There is a need to explore renewable alternatives (e.g., biofuels) that can produce energy sources to help reduce the reliance on fossil oils. In addition, the consumption of fossil oils adversely affects the environment and human health via the generation of waste water, greenhouse gases, and waste solids. Camelina sativa , originated from southeastern Europe and southwestern Asia, is being re-embraced as an industrial oilseed crop due to its high seed oil content (36–47%) and high unsaturated fatty acid composition (>90%), which are suitable for jet fuel, biodiesel, high-value lubricants and animal feed. C. sativa ’s agronomic advantages include short time to maturation, low water and nutrient requirements, adaptability to adverse environmental conditions and resistance to common pests and pathogens. These characteristics make it an ideal crop for sustainable agricultural systems and regions of marginal land. However, the lack of genetic and genomic resources has slowed the enhancement of this emerging oilseed crop and exploration of its full agronomic and breeding potential. Here, a core of 213 spring C. sativa accessions was collected and genotyped. The genotypic data was used to characterize genetic diversity and population structure to infer how natural selection and plant breeding may have affected the formation and differentiation within the C. sativa natural populations, and how the genetic diversity of this species can be used in future breeding efforts. A total of 6,192 high-quality single nucleotide polymorphisms (SNPs) were identified using genotyping-by-sequencing (GBS) technology. The average polymorphism information content (PIC) value of 0.29 indicate moderate genetic diversity for the C. sativa spring panel evaluated in this report. Population structure and principal coordinates analyses (PCoA) based on SNPs revealed two distinct subpopulations. Sub-population 1 (POP1) contains accessions that mainly originated from Germany while the majority of POP2 accessions (>75%) were collected from Eastern Europe. Analysis of molecular variance (AMOVA) identified 4% variance among and 96% variance within subpopulations, indicating a high gene exchange (or low genetic differentiation) between the two subpopulations. These findings provide important information for future allele/gene identification using genome-wide association studies (GWAS) and marker-assisted selection (MAS) to enhance genetic gain in C. sativa breeding programs.
Complexes of diploid and polyploid species have formed frequently during the evolution of land plants. In false flax (Camelina sativa), an important hexaploid oilseed crop closely related to Arabidopsis (Arabidopsis thaliana), the putative parental species as well as the origin of other Camelina species remained unknown. By using bacterial artificial chromosome-based chromosome painting, genomic in situ hybridization, and multi-gene phylogenetics, we aimed to elucidate the origin and evolution of the polyploid complex. Genomes of diploid camelinas (Camelina hispida, n 5 7; Camelina laxa, n 5 6; and Camelina neglecta, n 5 6) originated from an ancestral n 5 7 genome. The allotetraploid genome of Camelina rumelica (n 5 13, N 6 H) arose from hybridization between diploids related to C. neglecta (n 5 6, N 6 ) and C. hispida (n 5 7, H), and the N subgenome has undergone a substantial post-polyploid fractionation. The allohexaploid genomes of C. sativa and Camelina microcarpa (n 5 20, N 6 N 7 H) originated through hybridization between an auto-allotetraploid C. neglecta-like genome (n 5 13, N 6 N 7 ) and C. hispida (n 5 7, H), and the three subgenomes have remained stable overall since the genome merger. Remarkably, the ancestral and diploid Camelina genomes were shaped by complex chromosomal rearrangements, resembling those associated with human disorders and resulting in the origin of genome-specific shattered chromosomes.
The genus Camelina (Brassicaceae) comprises 7–8 diploid, tetraploid, and hexaploid species. Of particular agricultural interest is the biofuel crop, C. sativa (gold-of-pleasure or false flax), an allohexaploid domesticated from the widespread weed, C. microcarpa. Recent cytogenetics and genomics work has uncovered the identity of the parental diploid species involved in ancient polyploidization events in Camelina. However, little is known about the maternal subgenome ancestry of contemporary polyploid species. To determine the diploid maternal contributors of polyploid Camelina lineages, we sequenced and assembled 84 Camelina chloroplast genomes for phylogenetic analysis. Divergence time estimation was used to infer the timing of polyploidization events. Chromosome counts were also determined for 82 individuals to assess ploidy and cytotypic variation. Chloroplast genomes showed minimal divergence across the genus, with no observed gene-loss or structural variation. Phylogenetic analyses revealed C. hispida as a maternal diploid parent to the allotetraploid Camelina rumelica, and C. neglecta as the closest extant diploid contributor to the allohexaploids C. microcarpa and C. sativa. The tetraploid C. rumelica appears to have evolved through multiple independent hybridization events. Divergence times for polyploid lineages closely related to C. sativa were all inferred to be very recent, at only ~65 thousand years ago. Chromosome counts confirm that there are two distinct cytotypes within C. microcarpa (2n = 38 and 2n = 40). Based on these findings and other recent research, we propose a model of Camelina subgenome relationships representing our current understanding of the hybridization and polyploidization history of this recently-diverged genus.
Camelina neglecta is described as a new diploid species and its relationship to the other diploids of the genus and to the somewhat superficially similar tetraploid C. rumelica and hexaploid C. microcarpa, are discussed. SEM of seed and stem trichomes of the new species are presented.
Long intergenic noncoding RNAs (lincRNAs) are a large yet enigmatic class of eukaryotic transcripts that can have critical biological functions. The wealth of RNA-sequencing (RNA-seq) data available for plants provides the opportunity to implement a harmonized identification and annotation effort for lincRNAs that enables cross-species functional and genomic comparisons as well as prioritization of functional candidates. In this study, we processed >24 Tbp of RNA-seq data from >16,000 experiments to identify ∼130,000 lincRNAs in four Brassicaceae: Arabidopsis thaliana, Camelina sativa, Brassica rapa, and Eutrema salsugineum. We used Nanopore RNA-seq, transcriptome-wide structural information, peptide data, and epigenomic data to characterize these lincRNAs and identify conserved motifs. We then used comparative genomic and transcriptomic approaches to highlight lincRNAs in our dataset with sequence or transcriptional conservation. Finally, we used guilt-by-association analyses to assign putative functions to lincRNAs within our dataset. We tested this approach on a subset of lincRNAs associated with germination and seed development, observing germination defects for Arabidopsis lines harboring T-DNA insertions at these loci. LincRNAs with Brassicaceae-conserved putative miRNA binding motifs, small open reading frames, or abiotic-stress modulated expression are a few of the annotations that will guide functional analyses into this cryptic portion of the transcriptome.
Background Camelina sativa (gold-of-pleasure) is a traditional European oilseed crop and emerging biofuel source with high levels of desirable fatty acids. A twentieth century germplasm bottleneck depleted genetic diversity in the crop, leading to recent interest in using wild relatives for crop improvement. However, little is known about seed oil content and genetic diversity in wild Camelina species. Results We used gas chromatography, environmental niche assessment, and genotyping-by-sequencing to assess seed fatty acid composition, environmental distributions, and population structure in C. sativa and four congeners, with a primary focus on the crop’s wild progenitor, C. microcarpa. Fatty acid composition differed significantly between Camelina species, which occur in largely non-overlapping environments. The crop progenitor comprises three genetic subpopulations with discrete fatty acid compositions. Environment, subpopulation, and population-by-environment interactions were all important predictors for seed oil in these wild populations. A complementary growth chamber experiment using C. sativa confirmed that growing conditions can dramatically affect both oil quantity and fatty acid composition in Camelina. Conclusions Genetics, environmental conditions, and genotype-by-environment interactions all contribute to fatty acid variation in Camelina species. These insights suggest careful breeding may overcome the unfavorable FA compositions in oilseed crops that are predicted with warming climates.
Long intergenic noncoding RNAs (lincRNAs) are a large yet enigmatic class of eukaryotic transcripts with critical biological functions. Despite the wealth of RNA-seq data available, lincRNA identification lags in the plant lineage. In addition, there is a need for a harmonized identification and annotation effort to enable cross-species functional and genomic comparisons. In this study we processed >24 Tbp of RNA-seq data from >16,000 experiments to identify ~130,000 lincRNAs in four Brassicaceae: Arabidopsis thaliana, Camelina sativa, Brassica rapa, and Eutrema salsugineum. We used Nanopore RNA-seq, transcriptome-wide structural information, peptide data, and epigenomic data to characterize these lincRNAs and identify functional motifs. We then used comparative genomic and transcriptomic approaches to highlight lincRNAs in our dataset with sequence or transcriptional evolutionary conservation, including lincRNAs transcribed adjacent to orthologous genes that display little sequence similarity and likely function as transcriptional regulators. Finally, we used guilt-by-association techniques to further classify these lincRNAs according to putative function. LincRNAs with Brassicaceae-conserved putative miRNA binding motifs, short ORFs, and whose expression is modulated by abiotic stress are a few of the annotations that will prioritize and guide future functional analyses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.