Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.
The widespread adoption of RAD-Seq data in phylogeography means genealogical relationships previously evaluated using relatively few genetic markers can now be addressed with thousands of loci. One challenge, however, is that RAD-Seq generates complete genotypes for only a small subset of loci or individuals. Simulations indicate that loci with missing data can produce biased estimates of key population genetic parameters, although the influence of such biases in empirical studies is not well understood. Here we compare microsatellite data (8 loci) and RAD-Seq data (six datasets ranging from 239 to 25,198 loci) from red mangroves (Rhizophora mangle) in Florida to evaluate how different levels of data filtering influence phylogeographic inferences. For all datasets, we calculated population genetic statistics and evaluated population structure, and for RAD-Seq datasets, we additionally examined population structure using coalescence. We found higher F ST using microsatellites, but that RAD-Seq-based estimates approached those based on microsatellites as more loci with more missing data were included. Analyses of RAD-Seq datasets resolved the classic Gulf-Atlantic coastal phylogeographic break, which was not significant in the microsatellite analyses. Applying multiple levels of filtering to RAD-Seq datasets can provide a more complete picture of potential biases in the data and elucidate subtle phylogeographic patterns.
Phylogenomic evidence from an increasing number of studies has demonstrated that different data sets and analytical approaches often reconstruct strongly supported but conflicting relationships. In this study, 785 single‐copy nuclear genes and 75 complete plastomes were used to infer the phylogenetic relationships and estimate the historical biogeography of the apple genus Malus sensu lato, an economically important lineage disjunctly distributed in the Northern Hemisphere and involved in known and suspected hybridization and allopolyploidy events. The nuclear phylogeny recovered the monophyly of Malus s.l. (including Docynia); however, the genus was supported to be biphyletic in the plastid phylogeny. An ancient chloroplast capture event in the Eocene in western North America best explains the cytonuclear discordance. Our conflict analysis demonstrated that ILS, hybridization, and allopolyploidy could explain the widespread nuclear gene tree discordance. One deep hybridization event (Malus doumeri) and one recent event (Malus coronaria) were detected in Malus s.l. Furthermore, our historical biogeographic analysis integrating living and fossil data supported a widespread East Asian‐western North American origin of Malus s.l. in the Eocene, followed by several extinction and dispersal events in the Northern Hemisphere. We also propose a general workflow for assessing phylogenomic discordance and biogeographic analysis using deep genome skimming data sets.
With the decreasing cost and availability of many newly developed bioinformatics pipelines, next‐generation sequencing (NGS) has revolutionized plant systematics in recent years. Genome skimming has been widely used to obtain high‐copy fractions of the genomes, including plastomes, mitochondrial DNA (mtDNA), and nuclear ribosomal DNA (nrDNA). In this study, through simulations, we evaluated the optimal (minimum) sequencing depth and performance for recovering single‐copy nuclear genes (SCNs) from genome skimming data, by subsampling genome resequencing data and generating 10 data sets with different sequencing coverage in silico. We tested the performance of four data sets (plastome, nrDNA, mtDNA, and SCNs) obtained from genome skimming based on phylogenetic analyses of the Vitis clade at the genus level and Vitaceae at the family level, respectively. Our results showed that optimal minimum sequencing depth for high‐quality SCNs assembly via genome skimming was about 10× coverage. Without the steps of synthesizing baits and enrichment experiments, coupled with incredibly low sequencing costs, we showcase that deep genome skimming (DGS) is as effective for capturing large data sets of SCNs as the widely used Hyb‐Seq approach, in addition to capturing plastomes, mtDNA, and entire nrDNA repeats. DGS may serve as an efficient and economical alternative and may be superior to the popular target enrichment/Hyb‐Seq approach.
A central goal of comparative phylogeography is to understand how species‐specific traits interact with geomorphological history to govern the geographic distribution of genetic variation within species. One key biotic trait with an immense impact on the spatial patterns of intraspecific genetic differentiation is dispersal. Here, we quantify how species‐specific traits directly related to dispersal affect genetic variation in terrestrial organisms with adaptations for dispersal by sea, not land—the mangroves of the Caribbean. We investigate the phylogeography of white mangroves (Laguncularia racemosa, Combretaceae) and red mangroves (Rhizophora mangle, Rhizophoraceae) using chloroplast genomes and nuclear markers (thousands of RAD‐Seq loci) from individuals throughout the Caribbean. Both coastal tree species have viviparous propagules that can float in salt water for months, meaning they are capable of dispersing long distances. Spatially explicit tests of the role of ocean currents on patterning genetic diversity revealed that ocean currents act as a mechanism for facilitating dispersal, but other means of moving genetic material are also important. We measured pollen‐ vs. propagule‐mediated gene flow and discovered that in white mangroves, seeds were more important for promoting genetic connectivity between populations, but in red mangroves, the opposite was true: pollen contributed more. This result challenges our concept of the importance of both proximity to ocean currents for moving mangrove seeds and the extent of long‐distance pollen dispersal. This study also highlights the importance of spatially explicit quantification of both abiotic (ocean currents) and biotic (dispersal) factors contributing to gene flow to understand fully the phylogeographic histories of species.
Background and Aims Many monotypic gymnosperm lineages in south-east China paradoxically remain in relict status despite long evolutionary histories and ample opportunities for allopatric speciation, but this paradox has received little attention and has yet to be resolved. Here, we address this issue by investigating the evolutionary history of a relict conifer, Pseudotaxus chienii (Taxaceae). Methods DNA sequences from two chloroplast regions and 14 nuclear loci were obtained for 134 samples. The demographic history was inferred and the contribution of isolation by environment (IBE) in patterning genetic divergence was compared with that of isolation by distance (IBD). Key Results Three genetic clusters were identified. Approximate Bayesian computation analyses showed that the three clusters diverged in the late Pliocene (~3.68 Ma) and two admixture events were detected. Asymmetric gene flow and similar population divergence times (~ 3.74 Ma) were characterized using the isolation with migration model. Neither IBD nor IBE contributed significantly to genetic divergence, and the contribution of IBE was much smaller than that of IBD. Conclusions These results suggest that several monotypic relict gymnosperm lineages like P. chienii in south-east China did not remain in situ and undiversified for millions of years. On the contrary, they have been evolving and the extant populations have become established more recently, having insufficient time to speciate. Our findings provide a new perspective for understanding the formation and evolution of the relict gymnosperm flora of China as well as of the Sino-Japanese Flora.
Premise of the study:The One Thousand Plant Transcriptomes Project (1KP, 1000+ assembled plant transcriptomes) provides an enormous resource for developing microsatellite loci across the plant tree of life. We developed loci from these transcriptomes and tested their utility.Methods and Results:Using software packages and custom scripts, we identified microsatellite loci in 1KP transcriptomes. We assessed the potential for cross-amplification and whether loci were biased toward exons, as compared to markers derived from genomic DNA. We characterized over 5.7 million simple sequence repeat (SSR) loci from 1334 plant transcriptomes. Eighteen percent of loci substantially overlapped with open reading frames (ORFs), and electronic PCR revealed that over half the loci would amplify successfully in conspecific taxa. Transcriptomic SSRs were approximately three times more likely to map to translated regions than genomic SSRs.Conclusions:We believe microsatellites still have a place in the genomic age—they remain effective and cost-efficient markers. The loci presented here are a valuable resource for researchers.
The high dispersal capability of these species may lead to high genetic connectivity between sampling locations and little geographic structure. We also identified several locations that, based on genetic data, should be the focus of conservation efforts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.