The tree of life of fishes is in a state of flux because we still lack a comprehensive phylogeny that includes all major groups. The situation is most critical for a large clade of spiny-finned fishes, traditionally referred to as percomorphs, whose uncertain relationships have plagued ichthyologists for over a century. Most of what we know about the higher-level relationships among fish lineages has been based on morphology, but rapid influx of molecular studies is changing many established systematic concepts. We report a comprehensive molecular phylogeny for bony fishes that includes representatives of all major lineages. DNA sequence data for 21 molecular markers (one mitochondrial and 20 nuclear genes) were collected for 1410 bony fish taxa, plus four tetrapod species and two chondrichthyan outgroups (total 1416 terminals). Bony fish diversity is represented by 1093 genera, 369 families, and all traditionally recognized orders. The maximum likelihood tree provides unprecedented resolution and high bootstrap support for most backbone nodes, defining for the first time a global phylogeny of fishes. The general structure of the tree is in agreement with expectations from previous morphological and molecular studies, but significant new clades arise. Most interestingly, the high degree of uncertainty among percomorphs is now resolved into nine well-supported supraordinal groups. The order Perciformes, considered by many a polyphyletic taxonomic waste basket, is defined for the first time as a monophyletic group in the global phylogeny. A new classification that reflects our phylogenetic hypothesis is proposed to facilitate communication about the newly found structure of the tree of life of fishes. Finally, the molecular phylogeny is calibrated using 60 fossil constraints to produce a comprehensive time tree. The new time-calibrated phylogeny will provide the basis for and stimulate new comparative studies to better understand the evolution of the amazing diversity of fishes.
Sequencing reduced-representation libraries of restriction site-associated DNA (RADseq) to identify single nucleotide polymorphisms (SNPs) is quickly becoming a standard methodology for molecular ecologists. Because of the scale of RADseq data sets, putative loci cannot be assessed individually, making the process of filtering noise and correctly identifying biologically meaningful signal more difficult. Artefacts introduced during library preparation and/or bioinformatic processing of SNP data can create patterns that are incorrectly interpreted as indicative of population structure or natural selection. Therefore, it is crucial to carefully consider types of errors that may be introduced during laboratory work and data processing, and how to minimize, detect and remove these errors. Here, we discuss issues inherent to RADseq methodologies that can result in artefacts during library preparation and locus reconstruction resulting in erroneous SNP calls and, ultimately, genotyping error. Further, we describe steps that can be implemented to create a rigorously filtered data set consisting of markers accurately representing independent loci and compare the effect of different combinations of filters on four RAD data sets. At last, we stress the importance of publishing raw sequence data along with final filtered data sets in addition to detailed documentation of filtering steps and quality control measures.
High species richness and evenness in structurally complex habitats has been hypothesized to be associated with niche partitioning. To test this idea, relationships between habitat structural complexity in river littoral-zone habitats and morphological diversity of tropical fishes were examined in the Cinaruco River, Venezuela. Six habitat attributes were quantified in 45 sites spanning a range of structural complexity. Fishes were collected during day and night to estimate species density and relative abundances at each site. Twenty-two morphological variables were measured for each species. Principal components analysis (PCA) of physical habitat data yielded two axes that modeled >80% of variation across sites. The first two axes from PCA of fish morphological variables modeled >70% of variation. Species density during both day and night was negatively associated with flow velocity and positively associated with habitat complexity. Similarity of day and night samples from the same site was significantly greater for sites with high habitat complexity and low flow. In general, mean local assemblage morphological PC scores were not significantly associated with habitat PC scores. Average, maximum, and standard deviation of morphological Euclidean distances of local assemblages revealed positive associations with structural complexity and negative associations with flow. These relationships held even when the positive relationship of species density was statistically removed from assemblage morphological patterns. Findings suggest that both species niche compression and assemblage niche space increase when habitat complexity is greater and flow velocity is lower in this tropical lowland river.
Next-generation sequencing of reduced-representation genomic libraries provides a powerful methodology for genotyping thousands of single-nucleotide polymorphisms (SNPs) among individuals of nonmodel species. Utilizing genotype data in the absence of a reference genome, however, presents a number of challenges. One major challenge is the trade-off between splitting alleles at a single locus into separate clusters (loci), creating inflated homozygosity, and lumping multiple loci into a single contig (locus), creating artefacts and inflated heterozygosity. This issue has been addressed primarily through the use of similarity cut-offs in sequence clustering. Here, two commonly employed, postclustering filtering methods (read depth and excess heterozygosity) used to identify incorrectly assembled loci are compared with haplotyping, another postclustering filtering approach. Simulated and empirical data sets were used to demonstrate that each of the three methods separately identified incorrectly assembled loci; more optimal results were achieved when the three methods were applied in combination. The results confirmed that including incorrectly assembled loci in population-genetic data sets inflates estimates of heterozygosity and deflates estimates of population divergence. Additionally, at low levels of population divergence, physical linkage between SNPs within a locus created artificial clustering in analyses that assume markers are independent. Haplotyping SNPs within a locus effectively neutralized the physical linkage issue without having to thin data to a single SNP per locus. We introduce a Perl script that haplotypes polymorphisms, using data from single or paired-end reads, and identifies potentially problematic loci.
For protected species under management, ensuring that genetic and phenotypic diversity is maintained above critical levels is one of the primary goals of conservation (Funk, McKay, Hohenlohe, &
BackgroundIntrogression likely plays a significant role in evolution, but understanding the extent and consequences of this process requires a clear identification of species boundaries in each focal group. The delimitation of species, however, is a contentious endeavor. This is true not only because of the inadequacy of current tools to identify species lineages, but also because of the inherent ambiguity between natural populations and species paradigms. The result has been a debate about the supremacy of various species concepts and criteria. Here, we utilized multiple separate sources of molecular data, mtDNA, nuclear sequences, and microsatellites, to delimit species under a polytypic species concept (PTSC) and estimate the frequency and genomic extent of introgression in a Neotropical genus of cichlid fishes (Cichla). We compared our inferences of species boundaries and introgression under this paradigm to those when species are identified under a diagnostic species concept (DSC).ResultsWe find that, based on extensive molecular data and an inclusive species concept, 8 separate biological entities should be recognized rather than the 15 described species of Cichla. Under the PTSC, fewer individuals are expected to exhibit hybrid ancestry than under the DSC (~2% vs. ~12%), but a similar number of the species exhibit introgression from at least one other species (75% vs. 60%). Under either species concept, the phylogenetic breadth of introgression in this group is notable, with both sister species and species from different major mtDNA clades exhibiting introgression.ConclusionsIntrogression was observed to be a widespread phenomenon for delimited species in this group. While several instances of introgressive hybridization were observed in anthropogenically altered habitats, most were found in undisturbed natural habitats, suggesting that introgression is a natural but ephemeral part of the evolution of many tropical species. Nevertheless, even transient introgression may facilitate an increase in genetic diversity or transfer of adaptive mutations that have important consequences in the evolution of tropical biodiversity.
Conserving life‐history variation is a stated goal of many management programs, but the most effective means by which to accomplish this are often far from clear. Early‐ and late‐migrating forms of Chinook salmon (Oncorhynchus tshawytscha) face unequal pressure from natural and anthropogenic forces that may alter the impacts of genetic variation underlying heritable migration timing. Genomic regions of chromosome 28 are known to be strongly associated with migration variation in adult Chinook salmon, but it remains unclear whether there is consistent association among diverse lineages and populations in large basins such as the Columbia River. With high‐throughput genotyping (GT‐seq) and phenotyping methods, we examined the association of genetic variation in 28 markers (spanning GREB1L to ROCK1 of chromosome 28) with individual adult migration timing characteristics gleaned from passive integrated transponder recordings of over 5000 Chinook salmon from the three major phylogeographic lineages that inhabit the Columbia River Basin. Despite the strong genetic differences among them in putatively neutral genomic regions, each of the three lineages exhibited very similar genetic variants in the chromosome 28 region that were significantly associated with adult migration timing phenotypes. This is particularly notable for the interior stream‐type lineage, which exhibits an earlier and more constrained freshwater entry than the other lineages. In both interior stream‐type and interior ocean‐type lineages of Chinook salmon, heterozygotes of the most strongly associated linkage groups had largely intermediate migration timing relative to homozygotes, and results indicate codominance or possibly marginal partial dominance of the allele associated with early migration. Our results lend support to utilization of chromosome 28 variation in tracking and predicting run timing in these lineages of Chinook salmon in the Columbia River.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.