BackgroundCannabis sativa has been cultivated throughout human history as a source of fiber, oil and food, and for its medicinal and intoxicating properties. Selective breeding has produced cannabis plants for specific uses, including high-potency marijuana strains and hemp cultivars for fiber and seed production. The molecular biology underlying cannabinoid biosynthesis and other traits of interest is largely unexplored.ResultsWe sequenced genomic DNA and RNA from the marijuana strain Purple Kush using shortread approaches. We report a draft haploid genome sequence of 534 Mb and a transcriptome of 30,000 genes. Comparison of the transcriptome of Purple Kush with that of the hemp cultivar 'Finola' revealed that many genes encoding proteins involved in cannabinoid and precursor pathways are more highly expressed in Purple Kush than in 'Finola'. The exclusive occurrence of Δ9-tetrahydrocannabinolic acid synthase in the Purple Kush transcriptome, and its replacement by cannabidiolic acid synthase in 'Finola', may explain why the psychoactive cannabinoid Δ9-tetrahydrocannabinol (THC) is produced in marijuana but not in hemp. Resequencing the hemp cultivars 'Finola' and 'USO-31' showed little difference in gene copy numbers of cannabinoid pathway enzymes. However, single nucleotide variant analysis uncovered a relatively high level of variation among four cannabis types, and supported a separation of marijuana and hemp.ConclusionsThe availability of the Cannabis sativa genome enables the study of a multifunctional plant that occupies a unique role in human culture. Its availability will aid the development of therapeutic marijuana strains with tailored cannabinoid profiles and provide a basis for the breeding of hemp with improved agronomic characteristics.
Cannabis sativa is widely cultivated for medicinal, food, industrial, and recreational use, but much remains unknown regarding its genetics, including the molecular determinants of cannabinoid content. Here, we describe a combined physical and genetic map derived from a cross between the drug-type strain Purple Kush and the hemp variety "Finola." The map reveals that cannabinoid biosynthesis genes are generally unlinked but that aromatic prenyltransferase (AP), which produces the substrate for THCA and CBDA synthases (THCAS and CBDAS), is tightly linked to a known marker for total cannabinoid content. We further identify the gene encoding CBCA synthase (CBCAS) and characterize its catalytic activity, providing insight into how cannabinoid diversity arises in cannabis. THCAS and CBDAS (which determine the drug vs. hemp chemotype) are contained within large (>250 kb) retrotransposon-rich regions that are highly nonhomologous between drug-and hemp-type alleles and are furthermore embedded within ∼40 Mb of minimally recombining repetitive DNA. The chromosome structures are similar to those in grains such as wheat, with recombination focused in gene-rich, repeat-depleted regions near chromosome ends. The physical and genetic map should facilitate further dissection of genetic and molecular mechanisms in this commercially and medically important plant.
SUMMARYThe initial reactions of the phenylpropanoid pathway convert phenylalanine to p-coumaroyl CoA, a branch point metabolite from which many phenylpropanoids are made. Although the second enzyme of this pathway, cinnamic acid 4-hydroxylase (C4H), is well characterized, a mutant for the gene encoding this enzyme has not yet, to our knowledge, been identified, presumably because knock-out mutations in this gene would have severe phenotypes. This work describes the characterization of an allelic series of Arabidopsis reduced epidermal fluorescence 3 (ref3) mutants, each of which harbor mis-sense mutations in C4H (At2g30490). Heterologous expression of the mutant proteins in Escherichia coli yields enzymes that exhibit P420 spectra, indicative of mis-folded proteins, or have limited ability to bind substrate, indicating that the mutations we have identified affect protein stability and/or enzyme function. In agreement with the early position of C4H in phenylpropanoid metabolism, ref3 mutant plants accumulate decreased levels of several different classes of phenylpropanoid end-products, and exhibit reduced lignin deposition and altered lignin monomer content. Furthermore, these plants accumulate a novel hydroxycinnamic ester, cinnamoylmalate, which is not found in the wild type. The decreased C4H activity in ref3 also causes pleiotropic phenotypes, including dwarfism, male sterility and the development of swellings at branch junctions. Together, these observations indicate that C4H function is critical to the normal biochemistry and development of Arabidopsis.
Despite its cultivation as a source of food, fibre and medicine, and its global status as the most used illicit drug, the genus Cannabis has an inconclusive taxonomic organization and evolutionary history. Drug types of Cannabis (marijuana), which contain high amounts of the psychoactive cannabinoid Δ 9-tetrahydrocannabinol (THC), are used for medical purposes and as a recreational drug. Hemp types are grown for the production of seed and fibre, and contain low amounts of THC. Two species or gene pools (C. sativa and C. indica) are widely used in describing the pedigree or appearance of cultivated Cannabis plants. Using 14,031 single-nucleotide polymorphisms (SNPs) genotyped in 81 marijuana and 43 hemp samples, we show that marijuana and hemp are significantly differentiated at a genome-wide level, demonstrating that the distinction between these populations is not limited to genes underlying THC production. We find a moderate correlation between the genetic structure of marijuana strains and their reported C. sativa and C. indica ancestry and show that marijuana strain names often do not reflect a meaningful genetic identity. We also provide evidence that hemp is genetically more similar to C. indica type marijuana than to C. sativa strains.
Δ 9 -Tetrahydrocannabinol (THC) and other cannabinoids are responsible for the psychoactive and medicinal properties of Cannabis sativa L. (marijuana). The first intermediate in the cannabinoid biosynthetic pathway is proposed to be olivetolic acid (OA), an alkylresorcinolic acid that forms the polyketide nucleus of the cannabinoids. OA has been postulated to be synthesized by a type III polyketide synthase (PKS) enzyme, but so far type III PKSs from cannabis have been shown to produce catalytic byproducts instead of OA. We analyzed the transcriptome of glandular trichomes from female cannabis flowers, which are the primary site of cannabinoid biosynthesis, and searched for polyketide cyclase-like enzymes that could assist in OA cyclization. Here, we show that a type III PKS (tetraketide synthase) from cannabis trichomes requires the presence of a polyketide cyclase enzyme, olivetolic acid cyclase (OAC), which catalyzes a C2-C7 intramolecular aldol condensation with carboxylate retention to form OA. OAC is a dimeric α+β barrel (DABB) protein that is structurally similar to polyketide cyclases from Streptomyces species. OAC transcript is present at high levels in glandular trichomes, an expression profile that parallels other cannabinoid pathway enzymes. Our identification of OAC both clarifies the cannabinoid pathway and demonstrates unexpected evolutionary parallels between polyketide biosynthesis in plants and bacteria. In addition, the widespread occurrence of DABB proteins in plants suggests that polyketide cyclases may play an overlooked role in generating plant chemical diversity.natural products | phytocannabinoid | terpenophenolic | aldolase | ferredoxin-like
Lycophytes arose in the early Silurian (Ϸ400 Mya) and represent a major lineage of vascular plants that has evolved in parallel with the ferns, gymnosperms, and angiosperms. A hallmark of vascular plants is the presence of the phenolic lignin heteropolymer in xylem and other sclerified cell types. Although syringyl lignin is often considered to be restricted in angiosperms, it has been detected in lycophytes as well. Here we report the characterization of a cytochrome P450-dependent monooxygenase from the lycophyte Selaginella moellendorffii. Gene expression data, crossspecies complementation experiments, and in vitro enzyme assays indicate that this P450 is a ferulic acid/coniferaldehyde/coniferyl alcohol 5-hydroxylase (F5H), and is capable of diverting guaiacylsubstituted intermediates into syringyl lignin biosynthesis. Phylogenetic analysis indicates that the Selaginella F5H represents a new family of plant P450s and suggests that it has evolved independently of angiosperm F5Hs.L ignin is an aromatic heteropolymer that is deposited most abundantly in the secondary cell walls of vascular plants. It provides structural rigidity to the plant body while enabling individual tracheary elements to withstand the tension generated during water transport; it also serves a defensive role against herbivores and pathogens (1). Lignins are derived mainly from the phenylpropanoid monomers p-coumaryl, coniferyl, and sinapyl alcohol, which give rise to p-hydroxyphenyl, guaiacyl, and syringyl subunits when incorporated into the lignin polymer (2). In angiosperms, three cytochrome P450-dependent monooxygenases (P450s) are involved in the biosynthesis of lignin monomers, cinnamate 4-hydroxylase (C4H), p-coumaroyl shikimate/quinate 3Ј-hydroxylase (C3ЈH), and ferulic acid/coniferaldehyde/coniferyl alcohol 5-hydroxylase (F5H) ( Fig. 1) (3). C4H and C3ЈH are responsible for phenylpropanoid 4 and 3-hydroxylation (4, 5), respectively, whereas F5H catalyzes the 5-hydroxylation of coniferaldehyde and coniferyl alcohol, leading to the formation of syringyl lignin (6, 7). Lignin monomer composition has been found to vary among major phyla of vascular plants (2). Generally, ferns and gymnosperms deposit lignins that are derived primarily from guaiacyl monomers together with a small proportion of phydroxyphenyl units, whereas angiosperm lignins are guaiacyl/ syringyl copolymers that also can contain some p-hydroxyphenyl monomers. This distribution suggests that F5H may be a relatively recent addition to plants' biochemical repertoire. Nevertheless, there are older reports in the literature in which syringyl monomers have been detected in lignins from lycophytes, including species of Selaginella (8-12), by using histochemical reagents and by today's standards relatively crude chemical methods. These results have been verified recently by using more modern techniques (13). How species that diverged from angiosperms Ͼ400 Mya (14) acquired the ability to synthesize syringyl lignin is unknown. Results Lignin Composition Analysis in Representativ...
Plants produce a vast array of specialized metabolites, many of which are used as pharmaceuticals, flavors, fragrances, and other high-value fine chemicals. However, most of these compounds occur in non-model plants for which genomic sequence information is not yet available. The production of a large amount of nucleotide sequence data using next-generation technologies is now relatively fast and cost-effective, especially when using the latest Roche-454 and Illumina sequencers with enhanced base-calling accuracy. To investigate specialized metabolite biosynthesis in non-model plants we have established a data-mining framework, employing next-generation sequencing and computational algorithms, to construct and analyze the transcriptomes of 75 non-model plants that produce compounds of interest for biotechnological applications. After sequence assembly an extensive annotation approach was applied to assign functional information to over 800,000 putative transcripts. The annotation is based on direct searches against public databases, including RefSeq and InterPro. Gene Ontology (GO), Enzyme Commission (EC) annotations and associated Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway maps are also collected. As a proof-of-concept, the selection of biosynthetic gene candidates associated with six specialized metabolic pathways is described. A web-based BLAST server has been established to allow public access to assembled transcriptome databases for all 75 plant species of the PhytoMetaSyn Project (www.phytometasyn.ca).
SUMMARYThe psychoactive and analgesic cannabinoids (e.g. D 9 -tetrahydrocannabinol (THC)) in Cannabis sativa are formed from the short-chain fatty acyl-coenzyme A (CoA) precursor hexanoyl-CoA. Cannabinoids are synthesized in glandular trichomes present mainly on female flowers. We quantified hexanoyl-CoA using LC-MS/MS and found levels of 15.5 pmol g )1 fresh weight in female hemp flowers with lower amounts in leaves, stems and roots. This pattern parallels the accumulation of the end-product cannabinoid, cannabidiolic acid (CBDA). To search for the acyl-activating enzyme (AAE) that synthesizes hexanoyl-CoA from hexanoate, we analyzed the transcriptome of isolated glandular trichomes. We identified 11 unigenes that encoded putative AAEs including CsAAE1, which shows high transcript abundance in glandular trichomes. In vitro assays showed that recombinant CsAAE1 activates hexanoate and other short-and medium-chained fatty acids. This activity and the trichome-specific expression of CsAAE1 suggest that it is the hexanoyl-CoA synthetase that supplies the cannabinoid pathway. CsAAE3 encodes a peroxisomal enzyme that activates a variety of fatty acid substrates including hexanoate. Although phylogenetic analysis showed that CsAAE1 groups with peroxisomal AAEs, it lacked a peroxisome targeting sequence 1 (PTS1) and localized to the cytoplasm. We suggest that CsAAE1 may have been recruited to the cannabinoid pathway through the loss of its PTS1, thereby redirecting it to the cytoplasm. To probe the origin of hexanoate, we analyzed the trichome expressed sequence tag (EST) dataset for enzymes of fatty acid metabolism. The high abundance of transcripts that encode desaturases and a lipoxygenase suggests that hexanoate may be formed through a pathway that involves the oxygenation and breakdown of unsaturated fatty acids.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.