Site-specific gene addition can allow stable transgene expression for gene therapy. When possible, this is preferred over the use of promiscuously integrating vectors, which are sometimes associated with clonal expansion1 and oncogenesis2. Site-specific endonucleases that can induce high rates of targeted genome editing are finding increasing applications in biological discovery and gene therapy3. However, two safety concerns persist: (1) endonuclease-associated adverse effects, both on4 and off-target5,6; and (2) oncogene activation caused by promoter integration, even without nucleases7. Here, we perform recombinant adeno-associated virus (rAAV) mediated promoterless gene targeting without nucleases and demonstrate amelioration of the bleeding diathesis in haemophilia B mice. In particular, we target a promoterless human coagulation factor IX (hF9) gene to the liver-expressed albumin (Alb) locus. hF9 is targeted, along with a preceding 2A-peptide coding sequence, to be integrated just upstream to the Alb stop codon. While hF9 is fused to Alb at the DNA and RNA levels, two separate proteins are synthesized by way of ribosomal skipping. Thus, hF9 expression is linked to robust hepatic albumin expression without disrupting it. We injected an AAV8-hF9 vector into neonatal and adult mice and achieved on-target integration into ~0.5% of the albumin alleles in hepatocytes. We established that hF9 was produced only from on-target integration, and ribosomal skipping was highly efficient. Stable hF9 plasma levels at 7–20% of normal were obtained, and treated factor IX deficient mice had normal coagulation times. In conclusion, transgene integration as a 2A-fusion to a highly expressed endogenous gene may obviate the requirement for nucleases and/or vector-borne promoters. This method may allow for safe and efficacious gene targeting in both infants and adults by greatly diminishing off-target effects while still providing therapeutic levels of expression from integration.
Multiplexed single-cell RNA-seq analysis of multiple samples using pooling is a promising experimental design, offering increased throughput while allowing to overcome batch variation. To reconstruct the sample identify of each cell, genetic variants that segregate between the samples in the pool have been proposed as natural barcode for cell demultiplexing. Existing demultiplexing strategies rely on availability of complete genotype data from the pooled samples, which limits the applicability of such methods, in particular when genetic variation is not the primary object of study. To address this, we here present Vireo, a computationally efficient Bayesian model to demultiplex single-cell data from pooled experimental designs. Uniquely, our model can be applied in settings when only partial or no genotype information is available. Using pools based on synthetic mixtures and results on real data, we demonstrate the robustness of Vireo and illustrate the utility of multiplexed experimental designs for common expression analyses.
The X‐linked genetic bleeding disorder caused by deficiency of coagulator factor IX, hemophilia B, is a disease ideally suited for gene therapy with genome editing technology. Here, we identify a family with hemophilia B carrying a novel mutation, Y371D, in the human F9 gene. The CRISPR/Cas9 system was used to generate distinct genetically modified mouse models and confirmed that the novel Y371D mutation resulted in a more severe hemophilia B phenotype than the previously identified Y371S mutation. To develop therapeutic strategies targeting this mutation, we subsequently compared naked DNA constructs versus adenoviral vectors to deliver Cas9 components targeting the F9 Y371D mutation in adult mice. After treatment, hemophilia B mice receiving naked DNA constructs exhibited correction of over 0.56% of F9 alleles in hepatocytes, which was sufficient to restore hemostasis. In contrast, the adenoviral delivery system resulted in a higher corrective efficiency but no therapeutic effects due to severe hepatic toxicity. Our studies suggest that CRISPR/Cas‐mediated in situ genome editing could be a feasible therapeutic strategy for human hereditary diseases, although an efficient and clinically relevant delivery system is required for further clinical studies.
Single-cell RNA-seq (scRNA-seq) provides a comprehensive measurement of stochasticity in transcription, but the limitations of the technology have prevented its application to dissect variability in RNA processing events such as splicing. Here, we present BRIE (Bayesian regression for isoform estimation), a Bayesian hierarchical model that resolves these problems by learning an informative prior distribution from sequence features. We show that BRIE yields reproducible estimates of exon inclusion ratios in single cells and provides an effective tool for differential isoform quantification between scRNA-seq data sets. BRIE, therefore, expands the scope of scRNA-seq experiments to probe the stochasticity of RNA processing.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-017-1248-5) contains supplementary material, which is available to authorized users.
BackgroundRNA levels detected at steady state are the consequence of multiple dynamic processes within the cell. In addition to synthesis and decay, transcripts undergo processing. Metabolic tagging with a nucleotide analog is one way of determining the relative contributions of synthesis, decay and conversion processes globally.ResultsBy improving 4-thiouracil labeling of RNA in Saccharomyces cerevisiae we were able to isolate RNA produced during as little as 1 minute, allowing the detection of nascent pervasive transcription. Nascent RNA labeled for 1.5, 2.5 or 5 minutes was isolated and analyzed by reverse transcriptase-quantitative polymerase chain reaction and RNA sequencing. High kinetic resolution enabled detection and analysis of short-lived non-coding RNAs as well as intron-containing pre-mRNAs in wild-type yeast. From these data we measured the relative stability of pre-mRNA species with different high turnover rates and investigated potential correlations with sequence features.ConclusionsOur analysis of non-coding RNAs reveals a highly significant association between non-coding RNA stability, transcript length and predicted secondary structure. Our quantitative analysis of the kinetics of pre-mRNA splicing in yeast reveals that ribosomal protein transcripts are more efficiently spliced if they contain intron secondary structures that are predicted to be less stable. These data, in combination with previous results, indicate that there is an optimal range of stability of intron secondary structures that allows for rapid splicing.Electronic supplementary materialThe online version of this article (doi:10.1186/s13059-015-0848-1) contains supplementary material, which is available to authorized users.
The functional consequences of alternative splicing on altering the transcription rate have been the subject of intensive study in mammalian cells but less is known about effects of splicing on changing the transcription rate in yeast. We present several lines of evidence showing that slow RNA polymerase II elongation increases both cotranscriptional splicing and splicing efficiency and that faster elongation reduces cotranscriptional splicing and splicing efficiency in budding yeast, suggesting that splicing is more efficient when cotranscriptional. Moreover, we demonstrate that altering the RNA polymerase II elongation rate in either direction compromises splicing fidelity, and we reveal that splicing fidelity depends largely on intron length together with secondary structure and splice site score. These effects are notably stronger for the highly expressed ribosomal protein coding transcripts. We propose that transcription by RNA polymerase II is tuned to optimize the efficiency and accuracy of ribosomal protein gene expression, while allowing flexibility in splice site choice with the nonribosomal protein transcripts.[Supplemental material is available for this article.]Splicing is the process of removing introns from precursor messenger RNAs (pre-mRNAs) and joining adjacent exons to produce spliced mRNA. The excised intron, in the form of a branched lariat, is rapidly debranched and discarded. If genes contain multiple introns, alternative splicing pathways can give rise to distinct mRNA and protein isoforms by using alternative splice sites or by including or excluding particular exons. In human cells, ∼95% of transcripts are alternatively spliced, thereby greatly expanding the coding capacity of the genome (Kornblihtt et al. 2013). Moreover, alternative splicing events that introduce translational stop codons in mRNAs are generally coupled with nonsensemediated decay (NMD) to down-regulate that spliced isoform, offering an additional layer of gene expression regulation (Lewis et al. 2003;Sayani et al. 2008;Kawashima et al. 2014). In Saccharomyces cerevisiae (budding yeast) only ∼5% of genes contain an intron, although they produce ∼27% of total mRNA, because many intron-containing genes are highly expressed (Ares et al. 1999). There is extensive evidence that in both metazoans and budding yeast, the process of splicing occurs as soon as the intron is transcribed and before transcription termination, that is, cotranscriptionally (Alexander et al. 2010b;Ameur et al. 2011;Carrillo Oesterreich et al. 2016). As a result of splicing being cotranscriptional, RNA polymerase II (RNAPII) elongation rate can influence splicing. According to one model, referred to as the "kinetic coupling" model, variations in RNAPII elongation rate can alter the time available, or the "window of opportunity" for upstream splice sites to be recognized before competing downstream splice sites are produced (Roberts et al. 1998;de la Mata et al. 2003;Kornblihtt 2007;Naftelberg et al. 2015;Saldi et al. 2016). Consistent with this model, ...
Key findings • A novel approach for integrating DNA-seq and single-cell RNA-seq data to reconstruct clonal substructure for single-cell transcriptomes. • Evidence for non-neutral evolution of clonal populations in human fibroblasts. • Proliferation and cell cycle pathways are commonly distorted in mutated clonal populations.
Summary Single-cell sequencing is an increasingly used technology and has promising applications in basic research and clinical translations. However, genotyping methods developed for bulk sequencing data have not been well adapted for single-cell data, in terms of both computational parallelization and simplified user interface. Here we introduce a software, cellsnp-lite, implemented in C/C ++ and based on well-supported package htslib, for genotyping in single-cell sequencing data for both droplet and well based platforms. On various experimental data sets, it shows substantial improvement in computational speed and memory efficiency with retaining highly concordant results compared to existing methods. Cellsnp-lite, therefore, lightens the genetic analysis for increasingly large single-cell data. Availability The source code is freely available at https://github.com/single-cell-genetics/cellsnp-lite. Supplementary information Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.