Aquaculture is the fastest growing farmed food sector and will soon become the primary source of fish and shellfish for human diets. In contrast to crops and livestock, production is derived from numerous, exceptionally diverse species that are typically in the early stages of domestication. Genetic improvement of production traits via well-designed, managed breeding programmes has great potential to help meet the rising seafood demand driven by human population growth. Supported by continuous advances in sequencing and bioinformatics, genomics is increasingly being applied across the broad range of aquaculture species and at all stages of the domestication process to optimize selective breeding. In the future, combining genomic selection with biotechnological innovations, such as genome editing and surrogate broodstock technologies, may further expedite genetic improvement in aquaculture.
BackgroundThe functional divergence of duplicate genes (ohnologues) retained from whole genome duplication (WGD) is thought to promote evolutionary diversification. However, species radiation and phenotypic diversification are often temporally separated from WGD. Salmonid fish, whose ancestor underwent WGD by autotetraploidization ~95 million years ago, fit such a ‘time-lag’ model of post-WGD radiation, which occurred alongside a major delay in the rediploidization process. Here we propose a model, ‘lineage-specific ohnologue resolution’ (LORe), to address the consequences of delayed rediploidization. Under LORe, speciation precedes rediploidization, allowing independent ohnologue divergence in sister lineages sharing an ancestral WGD event.ResultsUsing cross-species sequence capture, phylogenomics and genome-wide analyses of ohnologue expression divergence, we demonstrate the major impact of LORe on salmonid evolution. One-quarter of each salmonid genome, harbouring at least 4550 ohnologues, has evolved under LORe, with rediploidization and functional divergence occurring on multiple independent occasions >50 million years post-WGD. We demonstrate the existence and regulatory divergence of many LORe ohnologues with functions in lineage-specific physiological adaptations that potentially facilitated salmonid species radiation. We show that LORe ohnologues are enriched for different functions than ‘older’ ohnologues that began diverging in the salmonid ancestor.ConclusionsLORe has unappreciated significance as a nested component of post-WGD divergence that impacts the functional properties of genes, whilst providing ohnologues available solely for lineage-specific adaptation. Under LORe, which is predicted following many WGD events, the functional outcomes of WGD need not appear ‘explosively’, but can arise gradually over tens of millions of years, promoting lineage-specific diversification regimes under prevailing ecological pressures.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-017-1241-z) contains supplementary material, which is available to authorized users.
Structural variants (SVs) are a major source of genetic and phenotypic variation, but remain challenging to accurately type and are hence poorly characterized in most species. We present an approach for reliable SV discovery in non-model species using whole genome sequencing and report 15,483 high-confidence SVs in 492 Atlantic salmon (Salmo salar L.) sampled from a broad phylogeographic distribution. These SVs recover population genetic structure with high resolution, include an active DNA transposon, widely affect functional features, and overlap more duplicated genes retained from an ancestral salmonid autotetraploidization event than expected. Changes in SV allele frequency between wild and farmed fish indicate polygenic selection on behavioural traits during domestication, targeting brain-expressed synaptic networks linked to neurological disorders in humans. This study offers novel insights into the role of SVs in genome evolution and the genetic architecture of domestication traits, along with resources supporting reliable SV discovery in non-model species.
Background Whole genome duplication (WGD) events have played a major role in eukaryotic genome evolution, but the consequence of these extreme events in adaptive genome evolution is still not well understood. To address this knowledge gap, we used a comparative phylogenetic model and transcriptomic data from seven species to infer selection on gene expression in duplicated genes (ohnologs) following the salmonid WGD 80–100 million years ago. Results We find rare cases of tissue-specific expression evolution but pervasive expression evolution affecting many tissues, reflecting strong selection on maintenance of genome stability following genome doubling. Ohnolog expression levels have evolved mostly asymmetrically, by diverting one ohnolog copy down a path towards lower expression and possible pseudogenization. Loss of expression in one ohnolog is significantly associated with transposable element insertions in promoters and likely driven by selection on gene dosage including selection on stoichiometric balance. We also find symmetric expression shifts, and these are associated with genes under strong evolutionary constraints such as ribosome subunit genes. This possibly reflects selection operating to achieve a gene dose reduction while avoiding accumulation of “toxic mutations”. Mechanistically, ohnolog regulatory divergence is dictated by the number of bound transcription factors in promoters, with transposable elements being one likely source of novel binding sites driving tissue-specific gains in expression. Conclusions Our results imply pervasive adaptive expression evolution following WGD to overcome the immediate challenges posed by genome doubling and to exploit the long-term genetic opportunities for novel phenotype evolution.
The long-term evolutionary impacts of whole-genome duplication (WGD) are strongly influenced by the ensuing rediploidization process. Following autopolyploidization, rediploidization involves a transition from tetraploid to diploid meiotic pairing, allowing duplicated genes (ohnologs) to diverge genetically and functionally. Our understanding of autopolyploid rediploidization has been informed by a WGD event ancestral to salmonid fishes, where large genomic regions are characterized by temporally delayed rediploidization, allowing lineage-specific ohnolog sequence divergence in the major salmonid clades. Here, we investigate the long-term outcomes of autopolyploid rediploidization at genomewide resolution, exploiting a recent "explosion" of salmonid genome assemblies, including a new genome sequence for the huchen (Hucho hucho). We developed a genome alignment approach to capture duplicated regions across multiple species, allowing us to create 121,864 phylogenetic trees describing genome-wide ohnolog divergence across salmonid evolution. Using molecular clock analysis, we show that 61% of the ancestral salmonid genome experienced an initial "wave" of rediploidization in the late Cretaceous (85-106 Ma). This was followed by a period of relative genomic stasis lasting 17-39 My, where much of the genome remained tetraploid. A second rediploidization wave began in the early Eocene and proceeded alongside species diversification, generating predictable patterns of lineage-specific ohnolog divergence, scaling in complexity with the number of speciation events. Using gene set enrichment, gene expression, and codon-based selection analyses, we provide insights into potential functional outcomes of delayed rediploidization. This study enhances our understanding of delayed autopolyploid rediploidization and has broad implications for future studies of WGD events.
1Structural variants (SVs) are a major source of genetic and phenotypic variation, but remain challenging to 2 accurately type and are hence poorly characterized in most species. We present an approach for reliable SV 3 discovery in non-model species using whole genome sequencing and report 15,483 high-confidence SVs in 4 492 Atlantic salmon (Salmo salar L.) sampled from a broad phylogeographic distribution. These SVs 5 recover population genetic structure with high resolution, include an active DNA transposon, widely affect 6 functional features, and overlap more duplicated genes retained from an ancestral salmonid 7 autotetraploidization event than expected. Changes in SV allele frequency between wild and farmed fish 8 indicate polygenic selection on behavioural traits during domestication, targeting brain-expressed synaptic 9 networks linked to neurological disorders in humans. This study offers novel insights into the role of SVs 10 in genome evolution and the genetic architecture of domestication traits, along with resources supporting 11 reliable SV discovery in non-model species. 12Main 13Modern genetics remains primarily focused on single nucleotide polymorphism (SNP) analyses, with a 14 growing recognition of the importance of larger structural variants (SVs) including inversions, insertions, 15 deletions and copy number variations (CNVs) (defined here as variants ≥100 bp), among others 1 . SVs 16 affect a larger proportion of bases in human genomes than SNPs 4 , are not always reliably tagged by SNPs 5 , 17 more frequently have regulatory impacts 6 , and have been shown to alter the structure, presence, number, 18 dosage, and regulation of many genes 1 . Nonetheless, SVs remain challenging to accurately type using 19 whole genome sequence data 2-3 , limiting our understanding of their biological roles and exploitation as 20 genetic markers. Consequently, there is a need for reliable SV detection approaches to fully exploit the fast-21 accumulating genome sequencing datasets in both model and non-model species, allowing for more 22 complete genetics investigations. Many tools exist for SV discovery using short-read sequencing data, but 23 all suffer from high false discovery rates (10-89%) 2,3,7 . This poses a challenge for truly de novo SV 24 detection in previously unstudied species lacking 'gold standard' reference SVs to help distinguish true 25 from false calls. Most studies rely on combining an ensemble of signals from different SV detection 26 methods, although this strategy does not reliably improve performance and can in some cases aggravate 27 false discovery 3 . Researchers therefore often apply independent experimental 8-9 or visualization methods 10 28 to validate a subset of SV calls. Overall, there remains an unsatisfactory lack of consensus on how to 29 validate the quality of de novo SV datasets in most species 3 . 31Salmonids have the highest combined economic, ecological and scientific importance among all fish 32 lineages, and have consequently been subject to hundreds of genetics stu...
The functional divergence of duplicate genes (ohnologues) retained from whole genome duplication (WGD) is thought to promote evolutionary diversification. However, species radiation and phenotypic diversification is often highly temporally-detached from WGD. Salmonid fish, whose ancestor experienced WGD by autotetraploidization ~95 Ma (i.e. ‘Ss4R’), fit such a ‘time-lag’ model of post-WGD radiation, which occurred alongside a major delay in the rediploidization process. Here we propose a model called ‘Lineage-specific Ohnologue Resolution’ (LORe) to address the phylogenetic and functional consequences of delayed rediploidization. Under LORe, speciation precedes rediploidization, allowing independent ohnologue divergence in sister lineages sharing an ancestral WGD event. Using cross-species sequence capture, phylogenomics and genome-wide analyses of ohnologue expression divergence, we demonstrate the major impact of LORe on salmonid evolution. One quarter of each salmonid genome, harbouring at least 4,500 ohnologues, has evolved under LORe, with rediploidization and functional divergence occurring on multiple independent occasions > 50 Myr post-WGD. We demonstrate the existence and regulatory divergence of many LORe ohnologues with functions in lineage-specific physiological adaptations that promoted salmonid species radiation. We show that LORe ohnologues are enriched for different functions than ‘older’ ohnologues that began diverging in the salmonid ancestor. LORe has unappreciated significance as a nested component of post-WGD divergence that impacts the functional properties of genes, whilst providing ohnologues available solely for lineage-specific adaptation. Under LORe, which is predicted following many WGD events, the functional outcomes of WGD need not appear ‘explosively’, but can arise gradually over tens of Myr, promoting lineage-specific diversification regimes under prevailing ecological pressures.
Background Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects. Results We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis -regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication. Conclusions SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB . Electronic supplementary material The online version of this article (10.1186/s12864-019-6051-0) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.