The accelerating rate at which DNA sequence data are now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by these sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine "microhaplotypes" and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false-positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full-sibling and single parent-offspring pair identification. We also show how the identification of half-sibling pairs requires so much data that physical linkage becomes a consideration, and that most published studies that attempt to do so are dramatically underpowered. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.
Marine species with pelagic larvae typically exhibit little population structure, suggesting long‐distance dispersal and high gene flow. Directly quantifying dispersal of marine fishes is challenging but important, particularly for the design of marine protected areas (MPAs). Here, we studied kelp rockfish (Sebastes atrovirens) sampled along ~25 km of coastline in a boundary current‐dominated ecosystem and used genetic parentage analysis to identify dispersal events and characterize them, because the distance between sedentary parents and their settled offspring is the lifetime dispersal distance. Large sample sizes and intensive sampling are critical for increasing the likelihood of detecting parent–offspring matches in such systems and we sampled more than 6,000 kelp rockfish and analysed them with a powerful set of 96 microhaplotype markers. We identified eight parent–offspring pairs with high confidence, including two juvenile fish that were born inside MPAs and dispersed to areas outside MPAs, and four fish born in MPAs that dispersed to nearby MPAs. Additionally, we identified 25 full‐sibling pairs, which occurred throughout the sampling area and included all possible combinations of inferred dispersal trajectories. Intriguingly, these included two pairs of young‐of‐the‐year siblings with one member each sampled in consecutive years. These sibling pairs suggest monogamy, either intentional or accidental, which has not been previously demonstrated in rockfishes. This study provides the first direct observation of larval dispersal events in a current‐dominated ecosystem and direct evidence that larvae produced within MPAs are exported both to neighbouring MPAs and to proximate areas where harvest is allowed.
Freshwater habitat alteration and marine fisheries can affect anadromous fish species, and populations fluctuating in size elicit conservation concern and coordinated management. We describe the development and characterization of two sets of 96 single nucleotide polymorphism (SNP) assays for two species of anadromous alosine fishes, alewife and blueback herring (collectively known as river herring), that are native to the Atlantic coast of North America. We used data from high‐throughput DNA sequencing to discover SNPs and then developed molecular genetic assays for genotyping sets of 96 individual loci in each species. The two sets of assays were validated with multiple populations that encompass both the geographic range and the known regional genetic stocks of both species. The SNP panels developed herein accurately resolved the genetic stock structure for alewife and blueback herring that was previously identified using microsatellites and assigned individuals to regional stock of origin with high accuracy. These genetic markers, which generate data that are easily shared and combined, will greatly facilitate ongoing conservation and management of river herring including genetic assignment of marine caught individuals to stock of origin.
Nearshore marine habitats are critical for a variety of commercially important fish species, but assessing fish communities in these habitats is costly and time‐intensive. Here, we leverage eDNA metabarcoding to characterize nearshore fish communities near Juneau, Alaska, USA, a high‐latitude environment with large tidal swings, strong currents, and significant freshwater input. We investigated whether species richness and community composition differed across three habitat types (sand beaches, eelgrass beds, and rocky shorelines) and between high and low tides. Additionally, we tested whether replication of field samples and PCR reactions influenced either species richness or composition. We amplified a 12S mitochondrial locus in our samples and identified 167 fish amplicon sequence variants, which were grouped into 24 unique taxa based on sequence similarity, with approximately half of these taxa resolved to single species. Species richness and composition inferred from eDNA differed substantially among habitats, with rock habitats containing fewer taxa and fewer overall detections than sand and eelgrass habitats. The effect of tide was not significant on its own, but a significant habitat‐tide interaction was documented, with the most pronounced differences in taxa between tides found in sand habitats. Power analyses indicated that additional field sampling is useful to detect small changes in species richness such as those due to tide. PCR replicates typically identified few additional taxa. Our results provide important information that can be used to guide future studies, most notably, that the influence of tide on eDNA results appears to be minimal and potentially isolated to certain habitats. This suggests that replication across tides may not be vital for future eDNA studies and that additional replication across space—particularly across heterogeneous environments—likely is a better allocation of sampling effort.
DNA metabarcoding is used to enumerate and identify taxa in both
environmental samples and tissue mixtures. The composition and
resolution of metabarcoding data depend on the primer(s) used. Markers
that amplify different genes can mitigate biases in primer affinity,
amplification efficiency, and reference database resolution, but few
empirical studies have evaluated markers for complementary performance.
Here, we assess the individual and joint performance of 22 markers for
detecting species in a DNA pool of >100 species of
primarily marine and freshwater fishes, but also including
representatives of elasmobranchs, cephalopods, and crustaceans. Marker
performance includes the integrated effect of primer specificity and
reference availability. We find that a portfolio of four markers
targeting 12S, 16S, and multiple regions of COI identifies 100% of
reference taxa to family and nearly 60% to species. We then use the
four markers in this portfolio to evaluate metabarcoding of
heterogeneous tissue mixtures, using experimental fishmeal to test: 1)
the tissue input threshold to ensure detection; 2) how read depth scales
with tissue abundance; and 3) the effect of non-target material in the
mixture on recovery of target taxa. We consistently detect taxa that
make up >1% of fishmeal mixtures and can detect taxa at
the lowest input level of 0.01%, but rare taxa (<1%) were
detected inconsistently across markers and replicates. Read counts
showed weak correlation with tissue input, suggesting they are not a
valid proxy for relative abundance. Despite this limitation, our results
demonstrate the value of a primer portfolio approach—tailored to the
taxa of interest—for detecting and identifying both rare and abundant
species in heterogeneous tissue mixtures.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.