Key Points Aberrations in genome maintenance and DNA repair genes including POT1 occur at a high frequency in Sézary syndrome. Candidate driver genes and affected pathways in Sézary syndrome show extensive heterogeneity but overlap with other mature T-cell lymphomas.
RNA sequencing using next-generation sequencing technologies (NGS) is currently the standard approach for gene expression profiling, particularly for large-scale highthroughput studies. NGS technologies comprise high throughput, cost efficient shortread RNA-Seq, while emerging single molecule, long-read RNA-Seq technologies have enabled new approaches to study the transcriptome and its function. The emerging single molecule, long-read technologies are currently commercially available by Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT), while new methodologies based on short-read sequencing approaches are also being developed in order to provide long range single molecule level information-for example, the ones represented by the 10x Genomics linked read methodology. The shift toward longread sequencing technologies for transcriptome characterization is based on current increases in throughput and decreases in cost, making these attractive for de novo transcriptome assembly, isoform expression quantification, and in-depth RNA species analysis. These types of analyses were challenging with standard short sequencing approaches, due to the complex nature of the transcriptome, which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes, as well as the high sequence similarity of highly abundant species of RNA, such as rRNAs. Here we aim to focus on single molecule level sequencing technologies and single-cell technologies that, combined with perturbation tools, allow the analysis of complete RNA species, whether short or long, at high resolution. In parallel, these tools have opened new ways in understanding gene functions at the tissue, network, and pathway levels, as well as their detailed functional characterization. Analysis of the epi-transcriptome, including RNA methylation and modification and the effects of such modifications on biological systems is now enabled through direct RNA sequencing instead of classical indirect approaches. However, many difficulties and challenges remain, such as methodologies to generate full-length RNA or cDNA libraries from all different species of RNAs, not only poly-A containing transcripts, and the identification of allele-specific transcripts due to current error rates of single molecule technologies, while the bioinformatics analysis on long-read data for accurate identification of 5 and 3 UTRs is still in development.
The transcriptome encompasses a range of species including messenger RNA, and other noncoding RNA such as rRNA, tRNA, and short and long noncoding RNAs. Due to the huge role played by mRNA in development and disease, several methods have been developed to sequence and characterize mRNA, with RNA sequencing (RNA-Seq) emerging as the current method of choice particularly for large high-throughput studies. Short-read RNA-Seq which involves sequencing of short cDNA fragments and computationally assembling them to reconstruct the transcriptome, or aligning them to a reference is the most widely used approach. However, due to inherent limitations of this approach in de novo transcriptome assembly and isoform quantification, long-read RNA-Seq approaches, which also happen to be single molecule sequencing approaches, are increasingly becoming the standard for de novo transcriptome assembly and isoform quantification. In this chapter, we review the technical aspects of the current methods of RNA-Seq, both short and long-read approaches, and data analysis methods available. We discuss recent advances in single-cell RNA-Seq and direct RNA-Seq approaches, which perhaps will dominate the future of RNA-Seq.
RNA sequencing using next-generation sequencing (NGS, RNA-Seq) technologies is currently the standard approach for gene expression profiling, particularly for large-scale high-throughput studies. NGS technologies comprise short-read RNA-Seq (dominated by Illumina) and long-read RNA-Seq technologies provided by Pacific Bioscience (PacBio) and Oxford Nanopore Technologies (ONT). Although short-read sequencing technologies are the most widely used, long-read technologies are increasingly becoming the standard approach for de novo transcriptome assembly and isoform expression quantification due to the complex nature of the transcriptome which consists of variable lengths of transcripts and multiple alternatively spliced isoforms for most genes. In this chapter, we describe experimental procedures for library preparation, sequencing, and associated data analysis approaches for PacBio and ONT with a major focus on full length cDNA synthesis, de novo transcriptome assembly, and isoform quantification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.