Advancing the production efficiency and profitability of aquaculture is dependent upon the ability to utilize a diverse array of genetic resources. The ultimate goals of aquaculture genomics, genetics and breeding research are to enhance aquaculture production efficiency, sustainability, product quality, and profitability in support of the commercial sector and for the benefit of consumers. In order to achieve these goals, it is important to understand the genomic structure and organization of aquaculture species, and their genomic and phenomic variations, as well as the genetic basis of traits and their interrelationships. In addition, it is also important to understand the mechanisms of regulation and evolutionary conservation at the levels of genome, transcriptome, proteome, epigenome, and systems biology. With genomic information and information between the genomes and phenomes, technologies for marker/causal mutation-assisted selection, genome selection, and genome editing can be developed for applications in aquaculture. A set of genomic tools and resources must be made available including reference genome sequences and their annotations (including coding and non-coding regulatory elements), genome-wide polymorphic markers, efficient genotyping platforms, high-density and high-resolution linkage maps, and transcriptome resources including non-coding transcripts. Genomic and genetic control of important performance and production traits, such as disease resistance, feed conversion efficiency, growth rate, processing yield, behaviour, reproductive characteristics, and tolerance to environmental stressors like low dissolved oxygen, high or low water temperature and salinity, must be understood. QTL need to be identified, validated across strains, lines and populations, and their mechanisms of control understood. Causal gene(s) need to be identified. Genetic and epigenetic regulation of important aquaculture traits need to be determined, and technologies for marker-assisted selection, causal gene/mutation-assisted selection, genome selection, and genome editing using CRISPR and other technologies must be developed, demonstrated with applicability, and application to aquaculture industries.Major progress has been made in aquaculture genomics for dozens of fish and shellfish species including the development of genetic linkage maps, physical maps, microarrays, single nucleotide polymorphism (SNP) arrays, transcriptome databases and various stages of genome reference sequences. This paper provides a general review of the current status, challenges and future research needs of aquaculture genomics, genetics, and breeding, with a focus on major aquaculture species in the United States: catfish, rainbow trout, Atlantic salmon, tilapia, striped bass, oysters, and shrimp. While the overall research priorities and the practical goals are similar across various aquaculture species, the current status in each species should dictate the next priority areas within the species. This paper is an output of the USDA Workshop fo...
Detection of coding/functional SNPs that change the biological function of a gene may lead to identification of putative causative alleles within QTL regions and discovery of genetic markers with large effects on phenotypes. This study has two-fold objectives, first to develop, and validate a 50K transcribed gene SNP-chip using RNA-Seq data. To achieve this objective, two bioinformatics pipelines, GATK and SAMtools, were used to identify ~21K transcribed SNPs with allelic imbalances associated with important aquaculture production traits including body weight, muscle yield, muscle fat content, shear force, and whiteness in addition to resistance/susceptibility to bacterial cold-water disease (BCWD). SNPs ere identified from pooled RNA-Seq data collected from ~620 fish, representing 98 families from growth- and 54 families from BCWD-selected lines with divergent phenotypes. In addition, ~29K transcribed SNPs without allelic-imbalances were strategically added to build a 50K Affymetrix SNP-chip. SNPs selected included two SNPs per gene from 14K genes and ~5K non-synonymous SNPs. The SNP-chip was used to genotype 1728 fish. The average SNP calling-rate for samples passing quality control (QC; 1,641 fish) was ≥ 98.5%. The second objective of this study was to test the feasibility of using the new SNP-chip in GWA (Genome-wide association) analysis to identify QTL explaining muscle yield variance. GWA study on 878 fish (representing 197 families from 2 consecutive generations) with muscle yield phenotypes and genotyped for 35K polymorphic markers (passing QC) identified several QTL regions explaining together up to 28.40% of the additive genetic variance for muscle yield in this rainbow trout population. The most significant QTLs were on chromosomes 14 and 16 with 12.71 and 10.49% of the genetic variance, respectively. Many of the annotated genes in the QTL regions were previously reported as important regulators of muscle development and cell signaling. No major QTLs were identified in a previous GWA study using a 57K genomic SNP chip on the same fish population. These results indicate improved detection power of the transcribed gene SNP-chip in the target trait and population, allowing identification of large-effect QTLs for important traits in rainbow trout.
The ENCODE project revealed that ~70% of the human genome is transcribed. While only 1–2% of the RNAs encode for proteins, the rest are non-coding RNAs. Long non-coding RNAs (lncRNAs) form a diverse class of non-coding RNAs that are longer than 200nt. Emerging evidence indicates that lncRNAs play critical roles in various cellular processes including regulation of gene expression. LncRNAs show low levels of gene expression and sequence conservation, which make their computational identification in genomes difficult. In this study, more than two billion Illumina sequence reads were mapped to the genome reference using the TopHat and Cufflinks software. Transcripts shorter than 200nt, with more than 83–100 amino acids ORF, or with significant homologies to the NCBI nr-protein database were removed. In addition, a computational pipeline was used to filter the remaining transcripts based on a protein-coding-score test. Depending on the filtering stringency conditions, between 31,195 and 54,503 lncRNAs were identified, with only 421 matching known lncRNAs in other species. A digital gene expression atlas revealed 2,935 tissue-specific and 3,269 ubiquitously-expressed lncRNAs. This study annotates the lncRNA rainbow trout genome and provides a valuable resource for functional genomics research in salmonids.
Filet quality traits determine consumer satisfaction and affect profitability of the aquaculture industry. Soft flesh is a criterion for fish filet downgrades, resulting in loss of value. Filet firmness is influenced by many factors, including rate of protein turnover. A 50K transcribed gene SNP chip was used to genotype 789 rainbow trout, from two consecutive generations, produced in the USDA/NCCCWA selective breeding program. Weighted single-step GBLUP (WssGBLUP) was used to perform genome-wide association (GWA) analyses to identify quantitative trait loci affecting filet firmness and protein content. Applying genomic sliding windows of 50 adjacent SNPs, 212 and 225 SNPs were associated with genetic variation in filet shear force and protein content, respectively. Four common SNPs in the ryanodine receptor 3 gene (RYR3) affected the aforementioned filet traits; this association suggests common mechanisms underlying filet shear force and protein content. Genes harboring SNPs were mostly involved in calcium homeostasis, proteolytic activities, transcriptional regulation, chromatin remodeling, and apoptotic processes. RYR3 harbored the highest number of SNPs ( n = 32) affecting genetic variation in shear force (2.29%) and protein content (4.97%). Additionally, based on single-marker analysis, a SNP in RYR3 ranked at the top of all SNPs associated with variation in shear force. Our data suggest a role for RYR3 in muscle firmness that may be considered for genomic- and marker-assisted selection in breeding programs of rainbow trout.
BackgroundCoding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait).ResultsGATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7–93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways.ConclusionThese results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3992-z) contains supplementary material, which is available to authorized users.
In fish, protein-coding and noncoding genes involved in muscle atrophy are not fully characterized. In this study, we characterized coding and noncoding genes involved in gonadogenesis-associated muscle atrophy, and investigated the potential functional interplay between these genes. Using RNA-Seq, we compared expression pattern of mRNAs, long noncoding RNAs (lncRNAs) and microRNAs of atrophying skeletal muscle from gravid females and control skeletal muscle from age-matched sterile individuals. A total of 852 mRNAs, 1,160 lncRNAs and 28 microRNAs were differentially expressed (DE) between the two groups. Muscle atrophy appears to be mediated by many genes encoding ubiquitin-proteasome system, autophagy related proteases, lysosomal proteases and transcription factors. Transcripts encoding atrogin-1 and mir-29 showed exceptional high expression in atrophying muscle, suggesting an important role in bulk muscle proteolysis. DE genes were co-localized in the genome with strong expression correlation, and they exhibited extensive ‘lncRNA-mRNA’, ‘lncRNA-microRNA’, ‘mRNA-microRNA’ and ‘lncRNA-protein’ physical interactions. DE genes exhibiting potential functional interactions comprised the highly correlated ‘lncRNA-mRNA-microRNA’ gene network described as ‘degradome’. This study pinpoints extensive coding and noncoding RNA interactions during muscle atrophy in fish, and provides valuable resources for future mechanistic studies.
Muscle yield and quality traits are important for the aquaculture industry and consumers. Genetic selection for these traits is difficult because they are polygenic and result from multifactorial interactions. To study the genetic architecture of these traits, phenotypic characterization of whole body weight (WBW), muscle yield, fat content, shear force and whiteness were measured in ~500 fish representing 98 families from a growth-selected line. RNA-Seq was used to sequence the muscle transcriptome of different families exhibiting divergent phenotypes for each trait. We have identified 240 and 1,280 differentially expressed (DE) protein-coding genes and long noncoding RNAs (lncRNAs), respectively, in fish families exhibiting contrasting phenotypes. Expression of many DE lncRNAs (n = 229) was positively correlated with overlapping, neighboring or distantly located protein-coding genes (n = 1,030), resulting in 3,392 interactions. Three DE antisense lncRNAs were co-expressed with sense genes known to impact muscle quality traits. Forty-four DE lncRNAs had potential sponge functions to miRNAs that affect muscle quality traits. This study (1) defines muscle quality associated protein-coding and noncoding genes and (2) provides insight into non-coding RNAs involvement in regulating growth and fillet quality traits in rainbow trout.
Bacterial cold-water disease caused by Flavobacterium psychrophilum is one of the major causes of mortality of salmonids. Three genetic lines of rainbow trout designated as ARS-Fp-R (resistant), ARS-Fp-C (control) and ARS-Fp-S (susceptible) have significant differences in survival rate following F. psychrophilum infection. Previous study identified transcriptome differences of immune-relevant protein-coding genes at basal and post infection levels among these genetic lines. Using RNA-Seq approach, we quantified differentially expressed (DE) long non-coding RNAs (lncRNAs) in response to F. psychrophilum challenge in these genetic lines. Pairwise comparison between genetic lines and different infection statuses identified 556 DE lncRNAs. A positive correlation existed between the number of the differentially regulated lncRNAs and that of the protein-coding genes. Several lncRNAs showed strong positive and negative expression correlation with their overlapped, neighboring and distant immune related protein-coding genes including complement components, cytokines, chemokines and several signaling molecules involved in immunity. The correlated expressions and genome-wide co-localization suggested that some lncRNAs may be involved in regulating immune-relevant protein-coding genes. This study provides the first evidence of lncRNA-mediated regulation of the anti-bacterial immune response in a commercially important aquaculture species and will likely help developing new genetic markers for rainbow trout disease resistance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.