Oil palm is the most productive oil-bearing crop. Planted on only 5% of the total vegetable oil acreage, palm oil accounts for 33% of vegetable oil, and 45% of edible oil worldwide, but increased cultivation competes with dwindling rainforest reserves. We report the 1.8 gigabase (Gb) genome sequence of the African oil palm Elaeis guineensis, the predominant source of worldwide oil production. 1.535 Gb of assembled sequence and transcriptome data from 30 tissue types were used to predict at least 34,802 genes, including oil biosynthesis genes and homologues of WRINKLED1 (WRI1), and other transcriptional regulators1, which are highly expressed in the kernel. We also report the draft sequence of the S. American oil palm Elaeis oleifera, which has the same number of chromosomes (2n=32) and produces fertile interspecific hybrids with E. guineensis2, but appears to have diverged in the new world. Segmental duplications of chromosome arms define the palaeotetraploid origin of palm trees. The oil palm sequence enables the discovery of genes for important traits as well as somaclonal epigenetic alterations which restrict the use of clones in commercial plantings3, and thus helps achieve sustainability for biofuels and edible oils, reducing the rainforest footprint of this tropical plantation crop.
Somaclonal variation arises in plants and animals when differentiated somatic cells are induced into a pluripotent state, but the resulting clones differ from each other and from their parents. In agriculture, somaclonal variation has hindered micropropagation of elite hybrids and genetically modified crops, but the mechanism remains a mystery1. The oil palm fruit abnormality, mantled, is a somaclonal variant arising from tissue culture that drastically reduces yield, and has largely halted efforts to clone elite hybrids for oil production2–4. Widely regarded as epigenetic5, mantling has defied explanation, but here we identify the MANTLED gene using Epigenome Wide Association Studies. DNA hypomethylation of a LINE retrotransposon related to rice Karma, in the intron of the homeotic gene DEFICIENS, is common to all mantled clones and is associated with alternative splicing and premature termination. Dense methylation near the Karma splice site (the Good Karma epiallele) predicts normal fruit set, while hypomethylation (the Bad Karma epiallele) predicts homeotic transformation, parthenocarpy and dramatic loss of yield. Loss of Karma methylation and small RNA in tissue culture contributes to the origin of mantled, while restoration in spontaneous revertants accounts for non-Mendelian inheritance. The ability to predict and cull mantling at the plantlet stage will facilitate the introduction of higher performing clones and optimize environmentally sensitive land resources.
A key event in the domestication and breeding of the oil palm, Elaeis guineensis, was loss of the thick coconut-like shell surrounding the kernel. Modern E. guineensis has three fruit forms, dura (thick-shelled), pisifera (shell-less) and tenera (thin-shelled), a hybrid between dura and pisifera1–4. The pisifera palm is usually female-sterile but the tenera yields far more oil than dura, and is the basis for commercial palm oil production in all of Southeast Asia5. Here, we describe the mapping and identification of the Shell gene responsible for the different fruit forms. Using homozygosity mapping by sequencing we found two independent mutations in the DNA binding domain of a homologue of the MADS-box gene SEEDSTICK (STK) which controls ovule identity and seed development in Arabidopsis. The Shell gene is responsible for the tenera phenotype in both cultivated and wild palms from sub-Saharan Africa, and our findings provide a genetic explanation for the single gene heterosis attributed to Shell, via heterodimerization. This gene mutation explains the single most important economic trait in oil palm, and has implications for the competing interests of global edible oil production, biofuels and rainforest conservation6.
Oil palm, a plantation crop of major economic importance in Southeast Asia, is the predominant source of edible oil worldwide. We report the identification of the VIRESCENS (VIR) gene, which controls fruit exocarp colour and is an indicator of ripeness. VIR is a R2R3-MYB transcription factor with homology to Lilium LhMYB12 and similarity to Arabidopsis PRODUCTION OF ANTHOCYANIN PIGMENT1 (PAP1). We identify five independent mutant alleles of VIR in over 400 accessions from sub-Saharan Africa that account for the dominant-negative virescens phenotype. Each mutation results in premature termination of the carboxy-terminal domain of VIR, resembling McClintock’s C1-I allele in maize. The abundance of alleles likely reflects cultural practices, by which fruits were venerated for magical and medicinal properties. The identification of VIR will allow selection of the trait at the seed or early-nursery stage, 3-6 years before fruits are produced, greatly advancing introgression into elite breeding material.
BackgroundThe commercial oil palm (Elaeis guineensis Jacq.) produces a mesocarp oil (commonly called ‘palm oil’) with approximately equal proportions of saturated and unsaturated fatty acids (FAs). An increase in unsaturated FAs content or iodine value (IV) as a measure of the degree of unsaturation would help to open up new markets for the oil. One way to manipulate the fatty acid composition (FAC) in palm oil is through introgression of favourable alleles from the American oil palm, E. oleifera, which has a more unsaturated oil.ResultsIn this study, a segregating E. oleifera x E. guineensis (OxG) hybrid population for FAC is used to identify quantitative trait loci (QTLs) linked to IV and various FAs. QTL analysis revealed 10 major and two putative QTLs for IV and six FAs, C14:0, C16:0, C16:1, C18:0, C18:1 and C18:2 distributed across six linkage groups (LGs), OT1, T2, T3, OT4, OT6 and T9. The major QTLs for IV and C16:0 on LGOT1 explained 60.0 – 69.0 % of the phenotypic trait variation and were validated in two independent BC2 populations. The genomic interval contains several key structural genes in the FA and oil biosynthesis pathways such as PATE/FATB, HIBCH, BASS2, LACS4 and DGAT1 and also a relevant transcription factor (TF), WRI1. The literature suggests that some of these genes can exhibit pleiotropic effects in the regulatory networks of these traits. Using the whole genome sequence data, markers tightly linked to the candidate genes were also developed. Clustering trait values according to the allelic forms of these candidate markers revealed significant differences in the IV and FAs of the palms in the mapping and validation crosses.ConclusionsThe candidate gene approach described and exploited here is useful to identify the potential causal genes linked to FAC and can be adopted for marker-assisted selection (MAS) in oil palm.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-2607-4) contains supplementary material, which is available to authorized users.
BackgroundOil palm is an important source of edible oil. The importance of the crop, as well as its long breeding cycle (10-12 years) has led to the sequencing of its genome in 2013 to pave the way for genomics-guided breeding. Nevertheless, the first set of gene predictions, although useful, had many fragmented genes. Classification and characterization of genes associated with traits of interest, such as those for fatty acid biosynthesis and disease resistance, were also limited. Lipid-, especially fatty acid (FA)-related genes are of particular interest for the oil palm as they specify oil yields and quality. This paper presents the characterization of the oil palm genome using different gene prediction methods and comparative genomics analysis, identification of FA biosynthesis and disease resistance genes, and the development of an annotation database and bioinformatics tools.ResultsUsing two independent gene-prediction pipelines, Fgenesh++ and Seqping, 26,059 oil palm genes with transcriptome and RefSeq support were identified from the oil palm genome. These coding regions of the genome have a characteristic broad distribution of GC3 (fraction of cytosine and guanine in the third position of a codon) with over half the GC3-rich genes (GC3 ≥ 0.75286) being intronless. In comparison, only one-seventh of the oil palm genes identified are intronless. Using comparative genomics analysis, characterization of conserved domains and active sites, and expression analysis, 42 key genes involved in FA biosynthesis in oil palm were identified. For three of them, namely EgFABF, EgFABH and EgFAD3, segmental duplication events were detected. Our analysis also identified 210 candidate resistance genes in six classes, grouped by their protein domain structures.ConclusionsWe present an accurate and comprehensive annotation of the oil palm genome, focusing on analysis of important categories of genes (GC3-rich and intronless), as well as those associated with important functions, such as FA biosynthesis and disease resistance. The study demonstrated the advantages of having an integrated approach to gene prediction and developed a computational framework for combining multiple genome annotations. These results, available in the oil palm annotation database (http://palmxplore.mpob.gov.my), will provide important resources for studies on the genomes of oil palm and related crops.ReviewersThis article was reviewed by Alexander Kel, Igor Rogozin, and Vladimir A. Kuznetsov.Electronic supplementary materialThe online version of this article (doi:10.1186/s13062-017-0191-4) contains supplementary material, which is available to authorized users.
The diacylglycerol acyltransferases (DGAT) (diacylglycerol:acyl-CoA acyltransferase, EC 2.3.1.20) are a key group of enzymes that catalyse the final and usually the most important rate-limiting step of triacylglycerol biosynthesis in plants and other organisms. Genes encoding four distinct functional families of DGAT enzymes have been characterised in the genome of the African oil palm, Elaeis guineensis. The contrasting features of the various isoforms within the four families of DGAT genes, namely DGAT1, DGAT2, DGAT3 and WS/DGAT are presented both in the oil palm itself and, for comparative purposes, in 12 other oil crop or model/related plants, namely Arabidopsis thaliana, Brachypodium distachyon, Brassica napus, Elaeis oleifera, Glycine max, Gossypium hirsutum, Helianthus annuus, Musa acuminata, Oryza sativa, Phoenix dactylifera, Sorghum bicolor, and Zea mays. The oil palm genome contains respectively three, two, two and two distinctly expressed functional copies of the DGAT1, DGAT2, DGAT3 and WS/DGAT genes. Phylogenetic analyses of the four DGAT families showed that the E. guineensis genes tend to cluster with sequences from P. dactylifera and M. acuminata rather than with other members of the Commelinid monocots group, such as the Poales which include the major cereal crops such as rice and maize. Comparison of the predicted DGAT protein sequences with other animal and plant DGATs was consistent with the E. guineensis DGAT1 being ER located with its active site facing the lumen while DGAT2, although also ER located, had a predicted cytosol-facing active site. In contrast, DGAT3 and some (but not all) WS/DGAT in E. guineensis are predicted to be soluble, cytosolic enzymes. Evaluation of E. guineensis DGAT gene expression in different tissues and developmental stages suggests that the four DGAT groups have distinctive physiological roles and are particularly prominent in developmental processes relating to reproduction, such as flowering, and in fruit/seed formation especially in the mesocarp and endosperm tissues.
Oil palm (Elaeis guineensis) is the most productive oil bearing crop worldwide. It has three fruit forms, namely dura (thick-shelled), pisifera (shell-less) and tenera (thin-shelled), which are controlled by the SHELL gene. The fruit forms exhibit monogenic co-dominant inheritance, where tenera is a hybrid obtained by crossing maternal dura and paternal pisifera palms. Commercial palm oil production is based on planting thin-shelled tenera palms, which typically yield 30% more oil than dura palms, while pisifera palms are female-sterile and have little to no palm oil yield. It is clear that tenera hybrids produce more oil than either parent due to single gene heterosis. The unintentional planting of dura or pisifera palms reduces overall yield and impacts land utilization that would otherwise be devoted to more productive tenera palms. Here, we identify three additional novel mutant alleles of the SHELL gene, which encode a type II MADS-box transcription factor, and determine oil yield via control of shell fruit form phenotype in a manner similar to two previously identified mutant SHELL alleles. Assays encompassing all five mutations account for all dura and pisifera palms analyzed. By assaying for these variants in 10,224 mature palms or seedlings, we report the first large scale accurate genotype-based determination of the fruit forms in independent oil palm planting sites and in the nurseries that supply them throughout Malaysia. The measured non-tenera contamination rate (10.9% overall on a weighted average basis) underscores the importance of SHELL genetic testing of seedlings prior to planting in production fields. By eliminating non-tenera contamination, comprehensive SHELL genetic testing can improve sustainability by increasing yield on existing planted lands. In addition, economic modeling demonstrates that SHELL gene testing will confer substantial annual economic gains to the oil palm industry, to Malaysian gross national income and to Malaysian government tax receipts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.