The growing need for cassava, a food and fuel crop, has led to a new plant breeding technique designed to accelerate breeding of cassava with modified starch.
CRISPR systems have been co-opted by Tn7-like elements to direct RNA-guided transposition. Type V-K CRISPR-associated transposons rely on the concerted activities of the pseudonuclease Cas12k, the AAA+ ATPase TnsC, the Zn-finger protein TniQ, and the transposase TnsB. Here we present a cryo-electron microscopic structure of a target DNA-bound Cas12k-transposon recruitment complex comprising RNA-guided Cas12k, TniQ, TnsC and, unexpectedly, the ribosomal protein S15. Complex assembly on target DNA results in complete R-loop formation mediated by critical interactions between TniQ and the trans-activating crRNA, and is coupled with TniQ-dependent nucleation of a TnsC filament. In vivo transposition assays corroborate our structural findings, and biochemical and functional analyses of S15 supports its role as a bona fide component of the type V crRNA-guided transposition machinery. Altogether, our work uncovers key aspects of the mechanisms underpinning RNA-mediated assembly of CRISPR-associated transposons that will guide their development as programmable site- specific gene insertion tools.
Background Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. Findings Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20–30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. Conclusions The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity.
Cassava mosaic disease (CMD) suppresses cassava yields across the tropics. The dominant CMD2 locus confers resistance to cassava mosaic geminiviruses. It has been reported that CMD2-type landraces lose resistance after regeneration through de novo morphogenesis. As full genome bisulfite sequencing failed to uncover an epigenetic mechanism for this loss of resistance, whole genome sequencing and genetic variant analysis was performed and the CMD2 locus was fine-mapped to a 190 kilobase interval. Collectively, these data indicate that CMD2-type resistance is caused by a nonsynonymous, single nucleotide polymorphism in DNA polymerase δ subunit 1 (MePOLD1) located within this region. Virus-induced gene silencing of MePOLD1 in a CMD-susceptible cassava variety produced a recovery phenotype typical of CMD2-type resistance. Analysis of other CMD2-type cassava varieties identified additional candidate resistance alleles within MePOLD1. Genetic variation of MePOLD1, therefore, could represent an important genetic resource for resistance breeding and/or genome editing, and elucidating mechanisms of resistance to geminiviruses.
CRISPR-Cas12a is a powerful RNA-guided genome-editing system, also emerging as a robust diagnostic tool that cleaves double-stranded DNA using only the RuvC domain. This opens an overarching question on how the spatially distant DNA target strand (TS) traverses toward the RuvC catalytic core. Here, continuous tens of microsecond-long molecular dynamics and free- energy simulations reveal that an ⍺-helical lid, located within the RuvC domain, plays a pivotal role in the traversal of the TS by anchoring the crRNA:TS hybrid and elegantly guiding the TS toward the RuvC core, as also corroborated by DNA cleavage experiments. In this mechanism, the REC2 domain pushes the crRNA:TS hybrid toward the core of the enzyme, while the Nuc domain aids the bending and accommodation of the TS within the RuvC core by bending inward. Understanding of this cardinal process in the functioning of Cas12a will enrich fundamental knowledge and facilitate further engineering strategies for genome-editing.
Although the canonical function of CRISPR-Cas systems is to provide adaptive immunity against mobile genetic elements, type I-F, I-B and V-K systems have been adopted by Tn7-like transposons to direct RNA-guided transposon insertion. Type V-K CRISPR-associated transposons rely on the activities of the pseudonuclease Cas12k, the transposase TnsB, the AAA+ ATPase TnsC and the zinc-finger protein TniQ. However, the molecular and structural details of RNA-directed DNA transposition have remained elusive. Here we report cryo-electron microscopic structures of a Cas12k-guide RNA-target DNA complex and a DNA-bound, polymeric TnsC filament. The Cas12k complex structure reveals an intricate guide RNA architecture and critical interactions mediating RNA-guided target DNA recognition. The assembly of the TnsC helical filament is ATP-dependent and accompanied by structural remodeling of the bound DNA duplex. In vivo transposition assays corroborate key features of the structures, and biochemical experiments further show that TniQ restricts TnsC polymerization, while the TnsB transposase interacts directly with TnsC filaments to trigger their disassembly upon ATP hydrolysis. Together, these results suggest a mechanistic model whereby RNA-directed target selection by Cas12k primes TnsC polymerization and DNA remodeling, generating a recruitment platform for TnsB to catalyze site-specific transposon insertion. The present work advances our mechanistic understanding of the cross-talk between CRISPR effectors and the transposition machinery and will inform design efforts to harness CRISPR-associated transposons as programmable site-specific gene insertion tools for genome engineering applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.