The clustered regularly interspaced short palindromic repeats (CRISPR)/Cas-mediated genome editing system has recently been used for haploid production in plants. Haploid induction using the CRISPR/Cas system represents an attractive approach in cannabis, an economically important industrial, recreational, and medicinal plant. However, the CRISPR system requires the design of precise (on-target) single-guide RNA (sgRNA). Therefore, it is essential to predict off-target activity of the designed sgRNAs to avoid unexpected outcomes. The current study is aimed to assess the predictive ability of three machine learning (ML) algorithms (radial basis function (RBF), support vector machine (SVM), and random forest (RF)) alongside the ensemble-bagging (E-B) strategy by synergizing MIT and cutting frequency determination (CFD) scores to predict sgRNA off-target activity through in silico targeting a histone H3-like centromeric protein, HTR12, in cannabis. The RF algorithm exhibited the highest precision, recall, and F-measure compared to all the tested individual algorithms with values of 0.61, 0.64, and 0.62, respectively. We then used the RF algorithm as a meta-classifier for the E-B method, which led to an increased precision with an F-measure of 0.62 and 0.66, respectively. The E-B algorithm had the highest area under the precision recall curves (AUC-PRC; 0.74) and area under the receiver operating characteristic (ROC) curves (AUC-ROC; 0.71), displaying the success of using E-B as one of the common ensemble strategies. This study constitutes a foundational resource of utilizing ML models to predict gRNA off-target activities in cannabis.
Cannabis (Cannabis sativa L.) is typically propagated using stem cuttings taken from mother plants to produce genetically uniform propagules. However, producers anecdotally report that clonal lines deteriorate over time and eventually produce clones with less vigor and lower cannabinoid levels than the original mother plant. While the cause of this deterioration has not been investigated, one potential contributor is the accumulation of somatic mutations within the plant. To test this, we used deep sequencing of whole genomes (>50×) to compare the variability within an individual cannabis cultivar Honey Banana plant sampled at the bottom, middle, and top. We called over six million sequence variants based on a reference genome and found that the top had the most by a sizable amount. Comparing the variants among the samples uncovered that nearly 600,000 (34%) were unique to the top while the bottom only contained 148,000 (12%), and middle with 77,000 (9%) unique variants. Bioinformatics tools were used to identify mutations in critical cannabinoid–terpene biosynthesis pathways. While none were identified as high impact, four genes contained more than double the average level of nucleotide diversity (π) in or near the gene. Two genes code for essential enzymes required for the cannabinoid pathway while the other two are in the terpene pathways, demonstrating that mutations were accumulating within these pathways and could influence their function. Overall, a measurable number of intraplant genetic diversity was discovered that could impact long‐term genetic fidelity of clonal lines and potentially contribute to the observed decline in vigor and cannabinoid content.
Drug-type cannabis is often multiplied using micropropagation methods to produce genetically uniform and disease/insect-free crops. However, micropropagated plantlets often exhibit phenotypic variation, leading to culture decline over time. In cannabis, the source of these changes remains unknown, though several factors (e.g., explant’s sources and prolonged in vitro culture) can result in such phenotypical variations. The study presented herein evaluates the effects of explant sources (i.e., nodal segments derived from the basal, near-basal, middle, and apical parts of the greenhouse-grown mother plant) over multiple subcultures (4 subcultures during 235 days) on multiplication parameters and leaf morphological traits of in vitro cannabis plantlets. While initial in vitro responses were similar among explants sourced from different regions of the plant, there were significant differences in performance over the course of multiple subcultures. Specifically, explant source and/or the number of subcultures significantly impacted plantlet height, number of nodes, and canopy surface area. The explants derived from the basal and near-basal parts of the plant resulted in the tallest shoots with the greatest number of nodes, while the explants derived from the middle and apical regions led to shorter shoots with fewer nodes. Moreover, the basal-derived explants produced cannabis plantlets with shorter but wider leaves which demonstrated the potential of such explants for in vitro rejuvenation practices with minimal culture decline. This study provides new evidence into the long-term impacts of explant source in cannabis micropropagation.
Cannabis is typically propagated using stem cuttings taken from mother plants to produce genetically uniform propagules. However, producers anecdotally report that clonal lines deteriorate over time and eventually produce clones with less vigour and lower cannabinoid levels than the original mother plant. While the cause of this deterioration has not been investigated, one potential contributor is the accumulation of somatic mutations within the plant. To test this, we used deep sequencing of whole genomes (>50x depth of coverage) to compare the variability within an individual Cannabis sativa cv. Honey Banana plant sampled at the bottom, middle and top. Overall, we called over 6 million sequence variants based on a published reference genome (SNPs, MNPs, and indels) and found that that the top had the most by a sizable amount. We compared the variants among the samples and uncovered that nearly 600K (34%) were unique to the top while the bottom only contained 148K (12%) and middle with 77K (9%) unique variants. Bioinformatics tools were used to identify high impact mutations in critical cannabinoid/terpene biosynthesis pathways. While none were identified, some contained more than double the average level of nucleotide diversity (π) in or near the gene, including OLS, CBDAS, HMGR2 and CsTPS9FN. The first two genes code for essential enzymes required for the cannabinoid pathway while the other two are involved in the terpene pathways, demonstrating that mutations were accumulating within these pathways and could influence their function. Overall, these data identified a measurable number of intra-plant genetic diversity that could impact the long-term genetic fidelity of clonal lines and potentially contribute to the observed decline in vigour and cannabinoid content.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.