Small RNA research is a rapidly growing field. Apart from microRNAs, which are important regulators of gene expression, other types of functional small RNA molecules have been reported in animals and plants. MicroRNAs are important in host-microbe interactions and parasite microRNAs might modulate the innate immunity of the host. Furthermore, small RNAs can be detected in bodily fluids making them attractive non-invasive biomarker candidates. Given the general broad interest in small RNAs, and in particular microRNAs, a large number of bioinformatics aided analysis types are needed by the scientific community. To facilitate integrated sRNA research, we developed sRNAtoolbox, a set of independent but interconnected tools for expression profiling from high-throughput sequencing data, consensus differential expression, target gene prediction, visual exploration in a genome context as a function of read length, gene list analysis and blast search of unmapped reads. All tools can be used independently or for the exploration and downstream analysis of sRNAbench results. Workflows like the prediction of consensus target genes of parasite microRNAs in the host followed by the detection of enriched pathways can be easily established. The web-interface interconnecting all these tools is available at http://bioinfo5.ugr.es/srnatoolbox
Since the original publication of sRNAtoolbox in 2015, small RNA research experienced notable advances in different directions. New protocols for small RNA sequencing have become available to address important issues such as adapter ligation bias, PCR amplification artefacts or to include internal controls such as spike-in sequences. New microRNA reference databases were developed with different foci, either prioritizing accuracy (low number of false positives) or completeness (low number of false negatives). Additionally, other small RNA molecules as well as microRNA sequence and length variants (isomiRs) have continued to gain importance. Finally, the number of microRNA sequencing studies deposited in GEO nearly triplicated from 2014 (280) to 2018 (764). These developments imply that fast and easy-to-use tools for expression profiling and subsequent downstream analysis of miRNA-seq data are essential to many researchers. Key features in this sRNAtoolbox release include addition of all major RNA library preparation protocols to sRNAbench and improvements in sRNAde, a tool that summarizes several aspects of small RNA sequencing studies including the detection of consensus differential expression. A special emphasis was put on the user-friendliness of the tools, for instance sRNAbench now supports parallel launching of several jobs to improve reproducibility and user time efficiency.
The NCBI Sequence Read Archive currently hosts microRNA sequencing data for over 800 different species, evidencing the existence of a broad taxonomic distribution in the field of small RNA research. Simultaneously, the number of samples per miRNA-seq study continues to increase resulting in a vast amount of data that requires accurate, fast and user-friendly analysis methods. Since the previous release of sRNAtoolbox in 2019, 55 000 sRNAbench jobs have been submitted which has motivated many improvements in its usability and the scope of the underlying annotation database. With this update, users can upload an unlimited number of samples or import them from Google Drive, Dropbox or URLs. Micro- and small RNA profiling can now be carried out using high-confidence Metazoan and plant specific databases, MirGeneDB and PmiREN respectively, together with genome assemblies and libraries from 441 Ensembl species. The new results page includes straightforward sample annotation to allow downstream differential expression analysis with sRNAde. Unassigned reads can also be explored by means of a new tool that performs mapping to microbial references, which can reveal contamination events or biologically meaningful findings as we describe in the example. sRNAtoolbox is available at: https://arn.ugr.es/srnatoolbox/.
Progressive evolution, or the tendency towards increasing complexity, is a controversial issue in biology, which resolution entails a proper measurement of complexity. Genomes are the best entities to address this challenge, as they encode the historical information of a species’ biotic and environmental interactions. As a case study, we have measured genome sequence complexity in the ancient phylum Cyanobacteria. To arrive at an appropriate measure of genome sequence complexity, we have chosen metrics that do not decipher biological functionality but that show strong phylogenetic signal. Using a ridge regression of those metrics against root-to-tip distance, we detected positive trends towards higher complexity in three of them. Lastly, we applied three standard tests to detect if progressive evolution is passive or driven—the minimum, ancestor–descendant, and sub-clade tests. These results provide evidence for driven progressive evolution at the genome-level in the phylum Cyanobacteria.
Transfer RNAs (tRNAs) are small adaptor RNAs essential for mRNA translation. Alterations in the cellular tRNA population can directly affect mRNA decoding rates and translational efficiency during cancer development and progression. To evaluate changes in the composition of the tRNA pool, multiple sequencing approaches have been developed to overcome reverse transcription blocks caused by the stable structures of these molecules and their numerous base modifications. However, it remains unclear whether current sequencing protocols faithfully capture tRNAs existing in cells or tissues. This is specifically challenging for clinical tissue samples that often present variable RNA qualities. For this reason, we developed ALL-tRNAseq, which combines the highly processive MarathonRT and RNA demethylation for the robust assessment of tRNA expression, together with a randomized adapter ligation strategy prior to reverse transcription to assess tRNA fragmentation levels in both cell lines and tissues. Incorporation of tRNA fragments not only informed on sample integrity but also significantly improved tRNA profiling of tissue samples. Our data showed that our profiling strategy effectively improves classification of oncogenic signatures in glioblastoma and diffuse large B-cell lymphoma tissues, particularly for samples presenting higher levels of RNA fragmentation, further highlighting the utility of ALL-tRNAseq for translational research.
Although miRNA-seq is extensively used in many different fields, its quality control is frequently restricted to a PhredScore-based filter. Other important quality related aspects like microRNA yield, the fraction of putative degradation products (such as rRNA fragments) or the percentage of adapter-dimers are hard to assess using absolute thresholds. Here we present mirnaQC, a webserver that relies on 34 quality parameters to assist in miRNA-seq quality control. To improve their interpretability, quality attributes are ranked using a reference distribution obtained from over 36 000 publicly available miRNA-seq datasets. Accepted input formats include FASTQ and SRA accessions. The results page contains several sections that deal with putative technical artefacts related to library preparation, sequencing, contamination or yield. Different visualisations, including PCA and heatmaps, are available to help users identify underlying issues. Finally, we show the usefulness of this approach by analysing two publicly available datasets and discussing the different quality issues that can be detected using mirnaQC.
Terminal nucleotidyl transferases are enzymes that add non-templated nucleotides to RNA molecules. In the case of microRNAs, this process was shown to be functionally relevant for their maturation process and generation of isomiRs with non-canonical mRNA targets. Deconvolution of these posttranscriptional modifications is challenging in particular for extracellular miRNAs that are considered as a target for minimally-invasive diagnostics. Massively parallel RNA sequencing is the only method that can truthfully reveal isomiR diversity in biological samples and determine relative quantities. Improvements aside, current small RNA sequencing strategies remain imprecise. We developed IsoSeek that diverges from these methods by making use of randomized 5’- and 3’-adapters combined with a 10N unique molecular identifier (UMI). Using synthetic miRNA and isomiR spike-in sets and testing depletion and RNA competition strategies in 7 sequencing rounds of >100 samples, we rigorously optimized and validated the technical accuracy of the IsoSeek method. In genetically-altered HEK293, we characterized the terminal uridylase (TUT4/TUT7) dependent miRNA uridylome and discovered extensive uridylation of disease-associated miRNAs. Notably, 3’-uridylated isomiR profiles of plasma extracellular vesicles (EVs) rely on UMI-correction. Thus, IsoSeek advances our knowledge of cell-free miRNAs and supports development into non-invasive biomarkers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.