RNA-protein interactions have critical roles in gene regulation. However, high-throughput methods to quantitatively analyze these interactions are lacking. We adapted an Illumina GAIIx sequencer to make several million such measurements with a High-Throughput Sequencing – RNA Affinity Profiling (HiTS-RAP) assay. Millions of cDNAs are sequenced, bound by the E. coli replication terminator protein Tus, and transcribed in situ, whereupon Tus halts transcription leaving RNA stably attached to its template DNA. The binding of fluorescently-labeled protein is then quantified in the sequencer. We used HiTS-RAP to measure the affinity of mutagenized libraries of GFP-binding and NELF-E binding aptamers to their respective targets and thereby identified regions in both aptamers that are critical for their RNA-protein interaction. We show that mutations additively affect binding affinity of the NELF-E binding aptamer, whose interaction depends mainly on a single-stranded RNA motif, but not that of the GFP aptamer, whose interaction depends primarily on secondary structure.
Eukaryotic RNA Polymerase II has been found at both promoters and distal enhancers, suggesting additional functions beyond mRNA production. To understand this role, we sequenced nascent RNAs at single-molecule resolution to unravel the interplay between Pol2 initiation, capping, and pausing genome-wide. Our analyses reveal two pause classes that are associated with different RNA capping profiles. More proximal pausing is associated with less complete capping, less elongation, and a more enhancer-like complement of transcription factors than later pausing. Unexpectedly, Transcription Start Sites (TSSes) are predominantly found in constellations composed of multiple divergent pairs. TSS clusters are intimately associated with precise arrays of nucleosomes, and correspond with boundaries of transcription factor binding and chromatin modification at promoters and enhancers. TSS architecture is remarkably similar after the dramatic transcriptional changes caused by heat stress. Together, our results suggest that promoter- and enhancer-associated Pol2 is a regulatory nexus for integrating information across TSS ensembles.
24Tracking active transcription with the nuclear run-on (NRO) assays has been instrumental in 25 uncovering mechanisms of gene regulation. The coupling of NROs with high-throughput 26 sequencing has facilitated the discovery of previously unannotated or undetectable RNA classes 27 genome-wide. Precision run-on sequencing (PRO-seq) is a run-on variant that maps polymerase 28 active sites with nucleotide or near-nucleotide resolution. One main drawback to this and many 29 other nascent RNA detection methods is the somewhat intimidating multi-day workflow 30 associated with creating the libraries suitable for high-throughput sequencing. Here, we present 31an improved PRO-seq protocol where many of the enzymatic steps are carried out while the 32 biotinylated NRO RNA remains bound to streptavidin-coated magnetic beads. These 33 adaptations reduce time, sample loss and RNA degradation, and we demonstrate that the 34 resulting libraries are of the same quality as libraries generated using the original published 35protocol. The assay is also more sensitive which permits reproducible, high-quality libraries from 36 10 4 -10 5 cells instead of 10 6 -10 7 . Altogether, the improved protocol is more tractable allows for 37 nascent RNA profiling from small samples, such as rare samples or FACS sorted cell 38populations. 39 40
Because RNA-protein interactions play a central role in a wide-array of biological processes, methods that enable a quantitative assessment of these interactions in a high-throughput manner are in great demand. Recently, we developed the High Throughput Sequencing-RNA Affinity Profiling (HiTS-RAP) assay, which couples sequencing on an Illumina GAIIx with the quantitative assessment of one or several proteins’ interactions with millions of different RNAs in a single experiment. We have successfully used HiTS-RAP to analyze interactions of EGFP and NELF-E proteins with their corresponding canonical and mutant RNA aptamers. Here, we provide a detailed protocol for HiTS-RAP, which can be completed in about a month (8 days hands-on time) including the preparation and testing of recombinant proteins and DNA templates, clustering DNA templates on a flowcell, high-throughput sequencing and protein binding with GAIIx, and finally data analysis. We also highlight aspects of HiTS-RAP that can be further improved and points of comparison between HiTS-RAP and two other recently developed methods, RNA-MaP and RBNS. A successful HiTS-RAP experiment provides the sequence and binding curves for approximately 200 million RNAs in a single experiment.
Conventional methods for viral genome sequencing largely use metatranscriptomic approaches or, alternatively, enrich for viral genomes by amplicon sequencing with virus-specific PCR or hybridization-based capture. These existing methods are costly, require extensive sample handling time, and have limited throughput. Here, we describe V-seq, an inexpensive, fast, and scalable method that performs targeted viral genome sequencing by multiplexing virus-specific primers at the cDNA synthesis step. We designed densely tiled reverse transcription (RT) primers across the SARS-CoV-2 genome, with a subset of hexamers at the 3 prime end to minimize mis-priming from the abundant human rRNA repeats and human RNA PolII transcriptome. We found that overlapping RT primers do not interfere, but rather act in concert to improve viral genome coverage in samples with low viral load. We provide a path to optimize V-seq with SARS-CoV-2 as an example. We anticipate that V-seq can be applied to investigate genome evolution and track outbreaks of RNA viruses in a cost-effective manner. More broadly, the multiplexed RT approach by V-seq can be generalized to other applications of targeted RNA sequencing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.