SignificanceA high-quality genome assembly of Camellia sinensis var. sinensis facilitates genomic, transcriptomic, and metabolomic analyses of the quality traits that make tea one of the world’s most-consumed beverages. The specific gene family members critical for biosynthesis of key tea metabolites, monomeric galloylated catechins and theanine, are indicated and found to have evolved specifically for these functions in the tea plant lineage. Two whole-genome duplications, critical to gene family evolution for these two metabolites, are identified and dated, but are shown to account for less amplification than subsequent paralogous duplications. These studies lay the foundation for future research to understand and utilize the genes that determine tea quality and its diversity within tea germplasm.
BackgroundTea is one of the most popular non-alcoholic beverages worldwide. However, the tea plant, Camellia sinensis, is difficult to culture in vitro, to transform, and has a large genome, rendering little genomic information available. Recent advances in large-scale RNA sequencing (RNA-seq) provide a fast, cost-effective, and reliable approach to generate large expression datasets for functional genomic analysis, which is especially suitable for non-model species with un-sequenced genomes.ResultsUsing high-throughput Illumina RNA-seq, the transcriptome from poly (A)+ RNA of C. sinensis was analyzed at an unprecedented depth (2.59 gigabase pairs). Approximate 34.5 million reads were obtained, trimmed, and assembled into 127,094 unigenes, with an average length of 355 bp and an N50 of 506 bp, which consisted of 788 contig clusters and 126,306 singletons. This number of unigenes was 10-fold higher than existing C. sinensis sequences deposited in GenBank (as of August 2010). Sequence similarity analyses against six public databases (Uniprot, NR and COGs at NCBI, Pfam, InterPro and KEGG) found 55,088 unigenes that could be annotated with gene descriptions, conserved protein domains, or gene ontology terms. Some of the unigenes were assigned to putative metabolic pathways. Targeted searches using these annotations identified the majority of genes associated with several primary metabolic pathways and natural product pathways that are important to tea quality, such as flavonoid, theanine and caffeine biosynthesis pathways. Novel candidate genes of these secondary pathways were discovered. Comparisons with four previously prepared cDNA libraries revealed that this transcriptome dataset has both a high degree of consistency with previous EST data and an approximate 20 times increase in coverage. Thirteen unigenes related to theanine and flavonoid synthesis were validated. Their expression patterns in different organs of the tea plant were analyzed by RT-PCR and quantitative real time PCR (qRT-PCR).ConclusionsAn extensive transcriptome dataset has been obtained from the deep sequencing of tea plant. The coverage of the transcriptome is comprehensive enough to discover all known genes of several major metabolic pathways. This transcriptome dataset can serve as an important public information platform for gene expression, genomics, and functional genomic studies in C. sinensis.
Tea plant is an important economic crop, which is used to produce the world's oldest and most widely consumed tea beverages. Here, we present a high-quality reference genome assembly of the tea plant (Camellia sinensis var. sinensis) consisting of 15 pseudo-chromosomes. LTR retrotransposons (LTR-RTs) account for 70.38% of the genome, and we present evidence that LTR-RTs play critical roles in genome size expansion and the transcriptional diversification of tea plant genes through preferential insertion in promoter regions and introns. Genes, particularly those coding for terpene biosynthesis proteins, associated with tea aroma and stress resistance were significantly amplified through recent tandem duplications and exist as gene clusters in tea plant genome. Phylogenetic analysis of the sequences of 81 tea plant accessions with diverse origins revealed three well-differentiated tea plant populations, supporting the proposition for the southwest origin of the Chinese cultivated tea plant and its later spread to western Asia through introduction. Domestication and modern breeding left significant signatures on hundreds of genes in the tea plant genome, particularly those associated with tea quality and stress resistance. The genomic sequences of the reported reference and resequenced tea plant accessions provide valuable resources for future functional genomics study and molecular breeding of improved cultivars of tea plants.
Summary Tea is the world's widely consumed nonalcohol beverage with essential economic and health benefits. Confronted with the increasing large‐scale omics‐data set particularly the genome sequence released in tea plant, the construction of a comprehensive knowledgebase is urgently needed to facilitate the utilization of these data sets towards molecular breeding. We hereby present the first integrative and specially designed web‐accessible database, Tea Plant Information Archive (TPIA; http://tpia.teaplant.org). The current release of TPIA employs the comprehensively annotated tea plant genome as framework and incorporates with abundant well‐organized transcriptomes, gene expressions (across species, tissues and stresses), orthologs and characteristic metabolites determining tea quality. It also hosts massive transcription factors, polymorphic simple sequence repeats, single nucleotide polymorphisms, correlations, manually curated functional genes and globally collected germplasm information. A variety of versatile analytic tools (e.g. JBrowse, blast, enrichment analysis, etc.) are established helping users to perform further comparative, evolutionary and functional analysis. We show a case application of TPIA that provides novel and interesting insights into the phytochemical content variation of section Thea of genus Camellia under a well‐resolved phylogenetic framework. The constructed knowledgebase of tea plant will serve as a central gateway for global tea community to better understand the tea plant biology that largely benefits the whole tea industry.
Many researchers have reported that obesity is a major risk factor for diabetes, cardiovascular diseases, several forms of cancer (such as breast, colon and prostate), pulmonary, osteoarticular and metabolic diseases in the past decades. Recently, the hypolipidemic and anti-obesity effects of green tea in animals and humans have slowly become a hot topic in nutritional and food science research. This review will up-date the information of the anti-obesity effects of green tea in human intervention and animal studies. During recent years, an increasing number of clinical trials have confirmed the beneficial effects of green tea on obesity. However, the optimal dose has not yet been established owing to the very different results from studies with a similar design, which may be caused by differences in the extent of obesity, dietary intake, physical activity intensity, the strength of subjects' compliance to test instruction, the genetic background of populations, body composition and dietary habits. Therefore, further investigations on a larger scale and with longer periods of observation and tighter controls are needed to define optimal doses in subjects with varying degrees of metabolic risk factors and to determine differences in beneficial effects among diverse populations. Moreover, data from laboratory studies have shown that green tea has important roles in fat metabolism by reducing food intake, interrupting lipid emulsification and absorption, suppressing adipogenesis and lipid synthesis and increasing energy expenditure via thermogenesis, fat oxidation and fecal lipid excretion. However, the exact molecular mechanisms remain elusive.
Strawberry (Fragaria × ananassa Duch), a fruit of economic and nutritional importance, is also a model species for fleshy fruits and genomics in Rosaceae. Strawberry fruit quality at different harvest stages is a function of the fruit's metabolite content, which results from physiological changes during fruit growth and ripening. In order to investigate strawberry fruit development, untargeted (GC-MS) and targeted (HPLC) metabolic profiling analyses were conducted. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were employed to explore the non-polar and polar metabolite profiles from fruit samples at seven developmental stages. Different cluster patterns and a broad range of metabolites that exerted influence on cluster formation of metabolite profiles were observed. Significant changes in metabolite levels were found in both fruits turning red and fruits over-ripening in comparison with red-ripening fruits. The levels of free amino acids decreased gradually before the red-ripening stage, but increased significantly in the over-ripening stage. Metabolite correlation and network analysis revealed the interdependencies of individual metabolites and metabolic pathways. Activities of several metabolic pathways, including ester biosynthesis, the tricarboxylic acid cycle, the shikimate pathway, and amino acid metabolism, shifted during fruit growth and ripening. These results not only confirmed published metabolic data but also revealed new insights into strawberry fruit composition and metabolite changes, thus demonstrating the value of metabolomics as a functional genomics tool in characterizing the mechanism of fruit quality formation, a key developmental stage in most economically important fruit crops.
R2R3-MYB, bHLH, and WD40 proteins have been shown to control multiple enzymatic steps in the biosynthetic pathway responsible for the production of flavonoids, important secondary metabolites in Camellia sinensis. Few related transcription factor genes have been documented. The presence of R2R3-MYB, bHLH, and WD40 were statistically and bioinformatically analyzed on 127,094 C. sinensis transcriptome unigenes, resulting in identification of 73, 49, and 134 genes, respectively. C. sinensis phylogenetic trees were constructed for R2R3-MYB and bHLH proteins using previous Arabidopsis data and further divided into 27 subgroups (Sg) and 32 subfamilies. Motifs in some R2R3-MYB subgroups were redefined. Furthermore, Sg26 and Sg27 were expanded compared to Arabidopsis data, and bHLH proteins in C. sinensis were grouped into nine subfamilies. According to the functional annotation of Arabidopsis, flavonoid biosynthesis in C. sinensis was predicted to include R2R3-MYB genes in Sg4 (6), Sg5 (2), and Sg7 (1), as well as bHLH genes in subfamily 2 (2) and subfamily 24 (5). The wide evolutionary gap prevented phylogenetic analysis of WD40s; however, a single gene, CsWD40-1, was observed to share 80.4 % sequence homogeny with AtTTG1. Analysis of CsMYB4-1, CsMYB4-2, CsMYB4-3, CsMYB4-4, CsMYB5-1, and CsMYB5-2 revealed the interaction motif [DE]Lx2[RK]x3Lx6Lx3R, potentially contributing to the specificity of the bHLH partner in the stable MYB-bHLH complex. Full-length end-to-end polymerase chain reaction (PCR) and quantitative reverse transcriptase (qRT)-PCR were used to validate selected genes and generate relative expression ratio profiles in C. sinensis leaves by developmental stage and treatment conditions, including hormone and wound treatments. Potential target binding sites were predicted.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.