Common carp (Cyprinus carpio) is an allotetraploid species derived from recent whole genome duplication and provides a model to study polyploid genome evolution in vertebrates. Here, we generate three chromosome-level reference genomes of C. carpio and compare to related diploid Cyprinid genomes. We identify a Barbinae lineage as potential diploid progenitor of C. carpio and then divide the allotetraploid genome into two subgenomes marked by a distinct genome similarity to the diploid progenitor. We estimate that the two diploid progenitors diverged around 23 Mya and merged around 12.4 Mya based on the divergence rates of homoeologous genes and transposable elements in two subgenomes. No extensive gene losses are observed in either subgenome. Instead, we find gene expression bias across surveyed tissues such that subgenome B is more dominant in homoeologous expression. CG methylation in promoter regions may play an important role in altering gene expression in allotetraploid C. carpio.
High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly.
Common carp (Cyprinus carpio) is an allotetraploid Cyprinid species derived from recent whole genome duplication and provides an excellent model system for studying polyploid genome evolution in vertebrates. To explore the origins and consequences of tetraploidy in C. carpio, we generated three chromosome-level new reference genomes of C. carpio and compared them to the related diploid Cyprinid genome sequences. We identified a progenitor-like diploid Barbinae lineage by analysing the phylogenetic relationship of the homoeologous genes of C. carpio and their orthologues in closely related diploid Cyprinids. We then characterized the allotetraploid origin of C. carpio and divided its genome into two homoeologous subgenomes that are marked by a distinct genome similarity to their diploid progenitor. On the basis of the divergence rates of homoeologous genes and transposable elements in two subgenomes, we estimated that the two diploid progenitor species diverged approximately 23 million years ago (Mya) and merged to form the allotetraploid C. carpio approximately 12.4 Mya, which likely correlated with environmental upheavals caused by the extensive uplift of the Qinghai-Tibetan Plateau. No large-scale gene losses or rediploidization were observed in the two subgenomes. Instead, we found extensive homoeologous gene expression bias across twelve surveyed tissues, which indicates that subgenome B is dominant in homoeologous expression. DNA methylation analysis suggested that CG methylation in promoter regions plays an important role in altering the expression of these homoeologous genes in allotetraploid C. carpio. This study provides an essential genome resource and insights for extending further investigation on the evolutionary consequences of vertebrate polyploidy.
The Amur ide (Leuciscus waleckii) is a cyprinid fish that is widely distributed in Northeast Asia. The Lake Dali Nur population inhabits one of the most extreme aquatic environments on Earth, with an alkalinity up to 50 mmol/L (pH 9.6), thus providing an exceptional model with which to characterize the mechanisms of genomic evolution underlying adaptation to extreme environments. Here, we developed the reference genome assembly for L. waleckii from Lake Dali Nur. Intriguingly, we identified unusual expanded long terminal repeats (LTRs) with higher nucleotide substitution rates than in many other teleosts, suggesting their more recent insertion into the L. waleckii genome. We also identified expansions in genes encoding egg coat proteins and natriuretic peptide receptors, possibly underlying the adaptation to extreme environmental stress. We further sequenced the genomes of 10 additional individuals from freshwater and 18 from Lake Dali Nur populations, and we detected a total of 7.6 million SNPs from both populations. In a genome scan and comparison of these two populations, we identified a set of genomic regions under selective sweeps that harbor genes involved in ion homoeostasis, acid-base regulation, unfolded protein response, reactive oxygen species elimination, and urea excretion. Our findings provide comprehensive insight into the genomic mechanisms of teleost fish that underlie their adaptation to extreme alkaline environments.
BackgroundAmur ide (Leuciscus waleckii) is an economically and ecologically important cyprinid species in Northern Asia. The Dali Nor population living in the soda lake Dali Nor can adapt the extremely high alkalinity, providing us a valuable material to understand the adaptation mechanism against extreme environmental stress in teleost.ResultsIn this study, we generated high-throughput RNA-Seq data from three tissues gill, liver and kidney of L. waleckii living in the soda lake Dali Nor and the fresh water lake Ganggeng Nor, then performed parallel comparisons of three tissues. Our results showed that out of assembled 64,603 transcript contigs, 28,391 contigs had been assigned with a known function, corresponding to 20,371 unique protein accessions. We found 477, 2,761 and 3,376 differentially expressed genes (DEGs) in the gill, kidney, and liver, respectively, of Dali Nor population compared to Ganggeng Nor population with FDR ≤ 0.01and fold-change ≥ 2. Further analysis revealed that well-known functional categories of genes and signaling pathway, which are associated with stress response and extreme environment adaptation, have been significantly enriched, including the functional categories of “response to stimulus”, “transferase activity”, “transporter activity” and “oxidoreductase activity”, and signaling pathways of “mTOR signaling”, “EIF2 signaling”, “superpathway of cholesterol biosynthesis”. We also identified significantly DEGs encoding important modulators on stress adaptation and tolerance, including carbonic anhydrases, heat shock proteins, superoxide dismutase, glutathione S-transferases, aminopeptidase N, and aminotransferases.ConclusionsOverall, this study demonstrated that transcriptome changes in L. waleckii played a role in adaptation to complicated environmental stress in the highly alkalized Dali Nor lake. The results set a foundation for further analyses on alkaline-responsive candidate genes, which help us understand teleost adaptation under extreme environmental stress and ultimately benefit future breeding for alkaline-tolerant fish strains.
SummaryCommunity ecology theory suggests that an individual's phenotype is determined by the phenotypes of its coexisting members to the extent at which this process can shape community evolution. Here, we develop a mapping theory to identify interaction quantitative trait loci (QTL) governing inter-individual dependence. We mathematically formulate the decision-making strategy of interacting individuals. We integrate these mathematical descriptors into a statistical procedure, enabling the joint characterization of how QTL drive the strengths of ecological interactions and how the genetic architecture of QTL is driven by ecological networks. In three fish full-sib mapping experiments, we identify a set of genome-wide QTL that control a range of societal behaviors, including mutualism, altruism, aggression, and antagonism, and find that these intraspecific interactions increase the genetic variation of body mass by about 50%. We showcase how the interaction QTL can be used as editors to reconstruct and engineer new social networks for ecological communities.
Polyunsaturated fatty acids (PUFAs) are a set of important nutrients that mainly include arachidonic acid (ARA4), docosahexaenoic acid (DHA), eicosapentaenoic acid (EPA), and α-linolenic acid (ALA). Recently, fish-derived PUFAs have been associated with cardiovascular health, fetal development, and improvement of brain functions. Studies have shown that fish muscular tissues are rich in PUFAs, which are influenced by various factors, including genetic variations, regulatory profiles, and methylation status of desaturase genes during fatty acid desaturation and elongation processes. However, the genetic mechanism and the pathways involved in fatty acid metabolism in fishes remain unclear. The overall aim of this study was to assess differences in gene expression responses among fishes with different fatty acid levels. To achieve this goal, we conducted genome-wide association analysis (GWAS) using a 250K SNP array in a population of 203 samples of common carp (Cyprinus carpio) and identified nine SNPs and 15 genes associated with muscular PUFA content. Then, RNA-Seq and whole genome bisulfite sequencing (WGBS) of different groups with high and low EPA, DHA, ARA4, and ALA contents in muscle, liver and brain tissues were conducted, resulting in 6,750 differentially expressed genes and 5,631 genes with differentially methylated promoters. Gene ontology and KEGG pathway enrichment analyses of RNA-Seq and WGBS results identified enriched pathways for fatty acid metabolism, which included the adipocytokine signaling pathway, ARA4 and linoleic acid metabolism pathway, and insulin signaling pathway. Integrated analysis indicated significant correlations between gene expression and methylation status among groups with high and low PUFA contents in muscular tissues. Taken together, these multi-level results uncovered candidate genes and pathways that are associated with fatty acid metabolism and paved the way for further genomic selection and carp breeding for PUFA traits.
Background: The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene. Conclusion: We generated a high-quality draft genome of C. argus, which will provide a valuable genetic resource for further biomedical investigations of this economically important teleost fish.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.