One of the most common questions asked before starting a new population genetic study using microsatellite allele frequencies is “how many individuals do I need to sample from each population?” This question has previously been answered by addressing how many individuals are needed to detect all of the alleles present in a population (i.e. rarefaction based analyses). However, we argue that obtaining accurate allele frequencies and accurate estimates of diversity are much more important than detecting all of the alleles, given that very rare alleles (i.e. new mutations) are not very informative for assessing genetic diversity within a population or genetic structure among populations. Here we present a comparison of allele frequencies, expected heterozygosities and genetic distances between real and simulated populations by randomly subsampling 5–100 individuals from four empirical microsatellite genotype datasets (Formica lugubris, Sciurus vulgaris, Thalassarche melanophris, and Himantopus novaezelandia) to create 100 replicate datasets at each sample size. Despite differences in taxon (two birds, one mammal, one insect), population size, number of loci and polymorphism across loci, the degree of differences between simulated and empirical dataset allele frequencies, expected heterozygosities and pairwise FST values were almost identical among the four datasets at each sample size. Variability in allele frequency and expected heterozygosity among replicates decreased with increasing sample size, but these decreases were minimal above sample sizes of 25 to 30. Therefore, there appears to be little benefit in sampling more than 25 to 30 individuals per population for population genetic studies based on microsatellite allele frequencies.
Conservation management strategies for many highly threatened species include conservation breeding to prevent extinction and enhance recovery. Pairing decisions for these conservation breeding programmes can be informed by pedigree data to minimize relatedness between individuals in an effort to avoid inbreeding, maximize diversity and maintain evolutionary potential. However, conservation breeding programmes struggle to use this approach when pedigrees are shallow or incomplete.While genetic data (i.e., microsatellites) can be used to estimate relatedness to inform pairing decisions, emerging evidence indicates this approach may lack precision in genetically depauperate species, and more effective estimates will likely be obtained from genomic data (i.e., thousands of genome-wide single nucleotide polymorphisms, or SNPs). Here, we compare relatedness estimates and subsequent pairing decisions using pedigrees, microsatellites and SNPs from whole-genome resequencing approaches in two critically endangered birds endemic to New Zealand: kakī/ black stilt (Himantopus novaezelandiae) and kākāriki karaka/orange-fronted parakeet (Cyanoramphus malherbi). Our findings indicate that SNPs provide more precise estimates of relatedness than microsatellites when assessing empirical parent-offspring and full sibling relationships. Further, our results show that relatedness estimates and subsequent pairing recommendations using PMx are most similar between pedigreeand SNP-based approaches. These combined results indicate that in lieu of robust pedigrees, SNPs are an effective tool for informing pairing decisions, which has important implications for many poorly pedigreed conservation breeding programmes worldwide.
|GALLA et AL.
Hybridization facilitated by human activities has dramatically altered the evolutionary trajectories of threatened taxa around the globe. Whereas introduced mammalian predators and widespread habitat loss and degradation clearly imperil the recovery and survival of the New Zealand endemic black stilt or kakī (Himantopus novaezelandiae), the risk associated with hybridization between this critically endangered endemic and its self-introduced congener, the pied stilt or poaka (Himantopus himantopus leucocephalus) is less clear. Here, we combine Bayesian admixture analyses of microsatellite data with mitochondrial DNA sequence data to assess the levels of hybridization and introgression between kakī and poaka. We show that birds classified as hybrids on the basis of adult plumage are indeed of hybrid origin and that hybridization between kakī and poaka is both extensive and bidirectional. Despite this, we found almost no evidence for introgression from poaka to kakī, thus negating the popular belief that kakī represent a hybrid swarm. To our knowledge, ours represents the first comprehensive study to document a lack of widespread introgression for a species at risk despite a recent history of extensive bidirectional human-induced hybridization. We attribute this rather surprising result, in part, to reduced reproductive success in female hybrids combined with a transient male-biased kakī sex ratio. To maximize the evolutionary potential of kakī, we use these data to recommend conservation management activities aimed to maintain the genetic integrity and to maximize the genetic diversity of this iconic rare bird.
Threatened species recovery programmes benefit from incorporating genomic data into conservation management strategies to enhance species recovery. However, a lack of readily available genomic resources, including conspecific reference genomes, often limits the inclusion of genomic data. Here, we investigate the utility of closely related high-quality reference genomes for single nucleotide polymorphism (SNP) discovery using the critically endangered kakī/black stilt (Himantopus novaezelandiae) and four Charadriiform reference genomes as proof of concept. We compare diversity estimates (i.e., nucleotide diversity, individual heterozygosity, and relatedness) based on kakī SNPs discovered from genotyping-by-sequencing and whole genome resequencing reads mapped to conordinal (killdeer, Charadrius vociferus), confamilial (pied avocet, Recurvirostra avosetta), congeneric (pied stilt, Himantopus himantopus) and conspecific reference genomes. Results indicate that diversity estimates calculated from SNPs discovered using closely related reference genomes correlate significantly with estimates calculated from SNPs discovered using a conspecific genome. Congeneric and confamilial references provide higher correlations and more similar measures of nucleotide diversity, individual heterozygosity, and relatedness. While conspecific genomes may be necessary to address other questions in conservation, SNP discovery using high-quality reference genomes of closely related species is a cost-effective approach for estimating diversity measures in threatened species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.