The rubber tree, Hevea brasiliensis, produces natural rubber that serves as an essential industrial raw material. Here, we present a high-quality reference genome for a rubber tree cultivar GT1 using single-molecule real-time sequencing (SMRT) and Hi-C technologies to anchor the $1.47-Gb genome assembly into 18 pseudochromosomes. The chromosome-based genome analysis enabled us to establish a model of spurge chromosome evolution, since the common paleopolyploid event occurred before the split of Hevea and Manihot. We show recent and rapid bursts of the three Hevea-specific LTR-retrotransposon families during the last 10 million years, leading to the massive expansion by $65.88% ($970 Mbp) of the whole rubber tree genome since the divergence from Manihot. We identify large-scale expansion of genes associated with whole rubber biosynthesis processes, such as basal metabolic processes, ethylene biosynthesis, and the activation of polysaccharide and glycoprotein lectin, which are important properties for latex production. A map of genomic variation between the cultivated and wild rubber trees was obtained, which contains $15.7 million high-quality single-nucleotide polymorphisms. We identified hundreds of candidate domestication genes with drastically lowered genomic diversity in the cultivated but not wild rubber trees despite a relatively short domestication history of rubber tree, some of which are involved in rubber biosynthesis. This genome assembly represents key resources for future rubber tree research and breeding, providing novel targets for improving plant biotic and abiotic tolerance and rubber production.
Macadamia tetraphylla (Proteaceae) is one of the two macadamia plants that are edible and of cultivated value. Only two chloroplast genomes were reported in Proteaceae so far. In this study, we report the complete chloroplast genome sequence of M. tetraphylla, which is the third reported chloroplast genome in Proteaceae. The chloroplast genome is 159,195 bp long and includes 113 genes. Its LSC, SSC and IR regions are 87,951, 18,748 and 26,248 bp long, respectively. Phylogenetic analysis indicates that M. tetraphylla was clustered with other two species of Proteaceae, the M. integrifoia and M. ternifolia.
Macadamia ternifolia is a subtropical fruit tree of the family Proteaceae. Chloroplast genome sequences play a significant role in the development of molecular markers in plant phylogenetic and population genetic studies. In this study, we report the complete chloroplast genome sequence of M. ternifolia. The chloroplast genome is 159,669 bp long and includes 113 genes. Its LSC, SSC, and IR regions are 88,072, 18,801, and 26,408 bp long, respectively. Sequence comparison of M. ternifolia and M. integrifolia indicates large sequence conservation between these two species, only few sequence variations including single nucleotide polymorphisms (SNPs) and inserts/deletes (InDels) were detected. ARTICLE HISTORY
Macadamia is a kind of evergreen nut trees which belong to the Proteaceae family. The two commercial macadamia species, Macadamia integrifolia and M. tetraphylla, are highly prized for their edible kernels. Catherine et al. reported M. integrifolia genome using NGS sequencing technology. However, the lack of a high-quality assembly for M. tetraphylla hinders the progress in biological research and breeding program. In this study, we report a high-quality genome sequence of M. tetraphylla using the Oxford Nanopore Technologies (ONT) technology. We generated an assembly of 750.54 Mb with a contig N50 length of 1.18 Mb, which is close to the size estimated by flow cytometry and k-mer analysis. Repetitive sequence represent 58.57% of the genome sequence, which is strikingly higher compared with M. integrifolia. A total of 31,571 protein-coding genes were annotated with an average length of 6,055 bp, of which 92.59% were functionally annotated. The genome sequence of M. tetraphylla will provide novel insights into the breeding of novel strains and genetic improvement of agronomic traits.
Macadamia is an evergreen tree belonging to the Proteaceae family. The two commercial macadamia species, Macadamia integrifolia and M. tetraphylla, are highly prized for their edible kernels. The M. integrifolia genome was recently sequenced, but the genome of M. tetraphylla has to date not been published, which limits the study of biological research and breeding in this species. This study reports a high-quality genome sequence of M. tetraphylla based on the Oxford Nanopore Technologies technology and high-throughput chromosome conformation capture techniques (Hi-C). An assembly of 750.87 Mb with 51.11 Mb N50 length was generated, close to the 740 and 758 Mb size estimates by flow cytometry and k-mer analysis, respectively. Genome annotation indicated that 61.42% of the genome is composed of repetitive sequences and 34.95% is composed of long terminal repeat retrotransposons. Up to 31,571 protein-coding genes were predicted, of which 92.59% were functionally annotated. The average gene length was 6,055 bp. Comparative genome analysis revealed that the gene families associated with defense response, lipid transport, steroid biosynthesis, triglyceride lipase activity, and fatty acid metabolism are expanded in the M. tetraphylla genome. The distribution of fourfold synonymous third-codon transversion showed a recent whole-genome duplication event in M. tetraphylla. Genomic and transcriptomic analysis identified 187 genes encoding 33 crucial oil biosynthesis enzymes, depicting a comprehensive map of macadamia lipid biosynthesis. Besides, the 55 identified WRKY genes exhibited preferential expression in root as compared to that in other tissues. The genome sequence of M. tetraphylla provides novel insights for breeding novel varieties and genetic improvement of agronomic traits.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.