SignificanceA high-quality genome assembly of Camellia sinensis var. sinensis facilitates genomic, transcriptomic, and metabolomic analyses of the quality traits that make tea one of the world’s most-consumed beverages. The specific gene family members critical for biosynthesis of key tea metabolites, monomeric galloylated catechins and theanine, are indicated and found to have evolved specifically for these functions in the tea plant lineage. Two whole-genome duplications, critical to gene family evolution for these two metabolites, are identified and dated, but are shown to account for less amplification than subsequent paralogous duplications. These studies lay the foundation for future research to understand and utilize the genes that determine tea quality and its diversity within tea germplasm.
l e t t e r sBamboo represents the only major lineage of grasses that is native to forests and is one of the most important nontimber forest products in the world. However, no species in the Bambusoideae subfamily has been sequenced. Here, we report a high-quality draft genome sequence of moso bamboo (P. heterocycla var. pubescens). The 2.05-Gb assembly covers 95% of the genomic region. Gene prediction modeling identified 31,987 genes, most of which are supported by cDNA and deep RNA sequencing data. Analyses of clustered gene families and gene collinearity show that bamboo underwent whole-genome duplication 7-12 million years ago. Identification of gene families that are key in cell wall biosynthesis suggests that the whole-genome duplication event generated more gene duplicates involved in bamboo shoot development. RNA sequencing analysis of bamboo flowering tissues suggests a potential connection between droughtresponsive and flowering genes.Bamboo is one of the most important non-timber forest products in the world. About 2.5 billion people depend economically on bamboo, and international trade in bamboo amounts to over 2.5 billion US dollars per year 1 . Bamboo has a rather striking life history, characterized by a prolonged vegetative phase lasting decades before flowering, thereby inhibiting genetic improvement. Recent genomic studies in bamboo have included genome-wide full-length cDNA sequencing 2 , chloroplast genome sequencing 3 , identification of syntenic genes between bamboo and other grasses 4 and phylogenetic analysis of Bambusoideae subspecies 5 . Fifty-nine simple sequence repeat markers from rice and sugarcane were used in the genetic diversity analyses of 23 bamboo species 6 , and 2 species-specific sequence-characterized amplified region markers were developed in the identification of different bamboo species 7 .Here, we report the draft genome of moso bamboo, a large woody bamboo that has ecological, economic and cultural value in Asia and accounts for ~70% of the total bamboo growth area. Comparative genome-wide analyses of bamboo to other grass species, including rice, maize and sorghum, yielded new genetic insights into the rapid and marked phenotypic and ecological divergence of bamboo and closely related grasses.The moso bamboo genome contains 24 pairs of chromosomes 8 (2n = 48) and is characteristic of a diploid (Supplementary Fig. 1a). We conducted a flow cytometry analysis and estimated that it had a genome size of 2.075 Gb (2C = 4.24 pg; Supplementary Fig. 1b), which was very close to that estimated in a previous report 9 .Because it is difficult to generate an inbred line of moso bamboo, owing to its infrequent sexual reproduction and the long periods of time between flowering intervals, we selected five plants from a single individual rhizome of the moso bamboo ecotype (P. heterocycla var. pubescens) and performed whole-genome shotgun sequencing. We generated 295 Gb of raw sequence data (approximately 147-fold coverage), including Illumina short reads and 10,327 pairs of BAC end ...
Rice is the principal food for over half of the population of the world. With its genome size of 430 megabase pairs (Mb), the cultivated rice species Oryza sativa is a model plant for genome research. Here we report the sequence analysis of chromosome 4 of O. sativa, one of the first two rice chromosomes to be sequenced completely. The finished sequence spans 34.6 Mb and represents 97.3% of the chromosome. In addition, we report the longest known sequence for a plant centromere, a completely sequenced contig of 1.16 Mb corresponding to the centromeric region of chromosome 4. We predict 4,658 protein coding genes and 70 transfer RNA genes. A total of 1,681 predicted genes match available unique rice expressed sequence tags. Transposable elements have a pronounced bias towards the euchromatic regions, indicating a close correlation of their distributions to genes along the chromosome. Comparative genome analysis between cultivated rice subspecies shows that there is an overall syntenic relationship between the chromosomes and divergence at the level of single-nucleotide polymorphisms and insertions and deletions. By contrast, there is little conservation in gene order between rice and Arabidopsis.
Morphinan-based painkillers are derived from opium poppy ( L.). We report a draft of the opium poppy genome, with 2.72 gigabases assembled into 11 chromosomes with contig N50 and scaffold N50 of 1.77 and 204 megabases, respectively. Synteny analysis suggests a whole-genome duplication at ~7.8 million years ago and ancient segmental or whole-genome duplication(s) that occurred before the Papaveraceae-Ranunculaceae divergence 110 million years ago. Syntenic blocks representative of phthalideisoquinoline and morphinan components of a benzylisoquinoline alkaloid cluster of 15 genes provide insight into how this cluster evolved. Paralog analysis identified P450 and oxidoreductase genes that combined to form the gene fusion essential for morphinan biosynthesis in opium poppy. Thus, gene duplication, rearrangement, and fusion events have led to evolution of specialized metabolic products in opium poppy.
Approximately 75% of vertebrate proteins belong to protein families encoded by multiple evolutionarily related genes, a pattern that emerged as a result of gene and genome duplications over the course of vertebrate evolution. In families of genes with similar or related functions, adaptation to a strong selective agent should involve multiple adaptive changes across the entire gene family. However, we know of no evolutionary studies that have explicitly addressed this point. Here, we show how 4 taxonomically diverse species of pufferfishes (Tetraodontidae) each evolved resistance to the guanidinium toxins tetrodotoxin (TTX) and saxitoxin (STX) via parallel amino acid replacements across all 8 sodium channels present in teleost fish genomes. This resulted in diverse suites of coexisting sodium channel types that all confer varying degrees of toxin resistance, yet show remarkable convergence among genes and phylogenetically diverse species. Using site-directed mutagenesis and expression of a vertebrate sodium channel, we also demonstrate that resistance to TTX/STX is enhanced up to 15-fold by single, frequently observed replacements at 2 sites that have not previously been implicated in toxin binding but show similar or identical replacements in pufferfishes and in distantly related vertebrate and nonvertebrate animals. This study presents an example of natural selection acting upon a complete gene family, repeatedly arriving at a diverse but limited number of adaptive changes within the same genome. To be maximally informative, we suggest that future studies of molecular adaptation should consider all functionally similar paralogs of the affected gene family.
Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.