Graphical AbstractHighlights d exRNA sequencing complexity and reproducibility varies across isolation methods d Deconvolution shows differential access to exRNA carriers by different methods d Performance of exRNA isolation methods vary across biofluids and RNA species d miRDaR enables customized selection of optimal exRNA isolation methods SUMMARYPoor reproducibility within and across studies arising from lack of knowledge regarding the performance of extracellular RNA (exRNA) isolation methods has hindered progress in the exRNA field. A systematic comparison of 10 exRNA isolation methods across 5 biofluids revealed marked differences in the complexity and reproducibility of the resulting small RNA-seq profiles. The relative efficiency with which each method accessed different exRNA carrier subclasses was determined by estimating the proportions of extracellular vesicle (EV)-, ribonucleoprotein (RNP)-, and highdensity lipoprotein (HDL)-specific miRNA signatures in each profile. An interactive web-based application (miRDaR) was developed to help investigators select the optimal exRNA isolation method for their studies. miRDar provides comparative statistics for all expressed miRNAs or a selected subset of miRNAs in the desired biofluid for each exRNA isolation method and returns a ranked list of exRNA isolation methods prioritized by complexity, expression level, and repro-ducibility. These results will improve reproducibility and stimulate further progress in exRNA biomarker development.
Isoforms of human miRNAs (isomiRs) are constitutively expressed with tissue- and disease-subtype-dependencies. We studied 10 271 tumor datasets from The Cancer Genome Atlas (TCGA) to evaluate whether isomiRs can distinguish amongst 32 TCGA cancers. Unlike previous approaches, we built a classifier that relied solely on ‘binarized’ isomiR profiles: each isomiR is simply labeled as ‘present’ or ‘absent’. The resulting classifier successfully labeled tumor datasets with an average sensitivity of 90% and a false discovery rate (FDR) of 3%, surpassing the performance of expression-based classification. The classifier maintained its power even after a 15× reduction in the number of isomiRs that were used for training. Notably, the classifier could correctly predict the cancer type in non-TCGA datasets from diverse platforms. Our analysis revealed that the most discriminatory isomiRs happen to also be differentially expressed between normal tissue and cancer. Even so, we find that these highly discriminating isomiRs have not been attracting the most research attention in the literature. Given their ability to successfully classify datasets from 32 cancers, isomiRs and our resulting ‘Pan-cancer Atlas’ of isomiR expression could serve as a suitable framework to explore novel cancer biomarkers.
MINTbase is a repository that comprises nuclear and mitochondrial tRNA-derived fragments (‘tRFs’) found in multiple human tissues. The original version of MINTbase comprised tRFs obtained from 768 transcriptomic datasets. We used our deterministic and exhaustive tRF mining pipeline to process all of The Cancer Genome Atlas datasets (TCGA). We identified 23 413 tRFs with abundance of ≥ 1.0 reads-per-million (RPM). To facilitate further studies of tRFs by the community, we just released version 2.0 of MINTbase that contains information about 26 531 distinct human tRFs from 11 719 human datasets as of October 2017. Key new elements include: the ability to filter tRFs on-the-fly by minimum abundance thresholding; the ability to filter tRFs by tissue keywords; easy access to information about a tRF’s maximum abundance and the datasets that contain it; the ability to generate relative abundance plots for tRFs across cancer types and convert them into embeddable figures; MODOMICS information about modifications of the parental tRNA, etc. Version 2.0 of MINTbase contains 15x more datasets and nearly 4x more distinct tRFs than the original version, yet continues to offer fast, interactive access to its contents. Version 2.0 is available freely at http://cm.jefferson.edu/MINTbase/.
The fragments that derive from transfer RNAs (tRNAs) are an emerging category of regulatory RNAs. Known as tRFs, these fragments were reported for the first time only a decade ago, making them a relatively recent addition to the ever-expanding pantheon of non-coding RNAs. tRFs are short, 16–35 nucleotides (nts) in length, and produced through cleavage of mature and precursor tRNAs at various positions. Both cleavage positions and relative tRF abundance depend strongly on context, including the tissue type, tissue state, and disease, as well as the sex, population of origin, and race/ethnicity of an individual. These dependencies increase the urgency to understand the regulatory roles of tRFs. Such efforts are gaining momentum, and comprise experimental and computational approaches. System-level studies across many tissues and thousands of samples have produced strong evidence that tRFs have important and multi-faceted roles. Here, we review the relevant literature on tRF biology in higher organisms, single cell eukaryotes, and prokaryotes.
tRNA-derived fragments (tRF) are a class of potent regulatory RNAs. We mined the datasets from The Cancer Genome Atlas (TCGA) representing 32 cancer types with a deterministic and exhaustive pipeline for tRNA fragments. We found that mitochondrial tRNAs contribute disproportionally more tRFs than nuclear tRNAs. Through integrative analyses, we uncovered a multitude of statistically significant and contextdependent associations between the identified tRFs and mRNAs. In many of the 32 cancer types, these associations involve mRNAs from developmental processes, receptor tyrosine kinase signaling, the proteasome, and metabolic pathways that include glycolysis, oxidative phosphorylation, and ATP synthesis. Even though the pathways are common to multiple cancers, the association of specific mRNAs with tRFs depends on and differs from cancer to cancer. The associations between tRFs and mRNAs extend to genomic properties as well; specifically, tRFs are positively correlated with shorter genes that have a higher density in repeats, such as ALUs, MIRs, and ERVLs. Conversely, tRFs are negatively correlated with longer genes that have a lower repeat density, suggesting a possible dichotomy between cell proliferation and differentiation. Analyses of bladder, lung, and kidney cancer data indicate that the tRF-mRNA wiring can also depend on a patient's sex. Sex-dependent associations involve cyclindependent kinases in bladder cancer, the MAPK signaling pathway in lung cancer, and purine metabolism in kidney cancer. Taken together, these findings suggest diverse and wide-ranging roles for tRFs and highlight the extensive interconnections of tRFs with key cellular processes and human genomic architecture.Significance: Across 32 TCGA cancer contexts, nuclear and mitochondrial tRNA fragments exhibit associations with mRNAs that belong to concrete pathways, encode proteins with particular destinations, have a biased repeat content, and are sex dependent.
MicroRNA (miRNA) isoforms (“isomiRs”) and tRNA-derived fragments (“tRFs”) are powerful regulatory non-coding RNAs (ncRNAs). In human tissues, both types of molecules are abundant, with expression patterns that depend on a person’s race, sex and population origin. Here, we present our analyses of the Prostate Cancer (PRAD) datasets of The Cancer Genome Atlas (TCGA) from the standpoint of isomiRs and tRFs. This study represents the first simultaneous examination of isomiRs and tRFs in a large cohort of PRAD patients. We find that isomiRs and tRFs have extensive correlations with messenger RNAs (mRNAs). These correlations are disrupted in PRAD, which suggests disruptions of the regulatory network in the disease state. Notably, we find that the profiles of isomiRs and tRFs differ in patients belonging to different races. We hope that the presented findings can lay the groundwork for future research efforts aimed at elucidating the functional roles of the numerous and distinct members of these two categories of ncRNAs that are present in PRAD.
Background: The advent of next generation sequencing (NGS) has allowed the discovery of short and long noncoding RNAs (ncRNAs) in an unbiased manner using reverse genetics approaches, enabling the discovery of multiple categories of ncRNAs and characterization of the way their expression is regulated. We previously showed that the identities and abundances of microRNA isoforms (isomiRs) and transfer RNA-derived fragments (tRFs) are tightly regulated, and that they depend on a person's sex and population origin, as well as on tissue type, tissue state, and disease type. Here, we characterize the regulation and distribution of fragments derived from ribosomal RNAs (rRNAs). rRNAs form a group that includes four (5S, 5.8S, 18S, 28S) rRNAs encoded by the human nuclear genome and two (12S, 16S) by the mitochondrial genome. rRNAs constitute the most abundant RNA type in eukaryotic cells. Results: We analyzed rRNA-derived fragments (rRFs) across 434 transcriptomic datasets obtained from lymphoblastoid cell lines (LCLs) derived from healthy participants of the 1000 Genomes Project. The 434 datasets represent five human populations and both sexes. We examined each of the six rRNAs and their respective rRFs, and did so separately for each population and sex. Our analysis shows that all six rRNAs produce rRFs with unique identities, normalized abundances, and lengths. The rRFs arise from the 5′-end (5′-rRFs), the interior (i-rRFs), and the 3′-end (3′-rRFs) or straddle the 5′ or 3′ terminus of the parental rRNA (x-rRFs). Notably, a large number of rRFs are produced in a population-specific or sexspecific manner. Preliminary evidence suggests that rRF production is also tissue-dependent. Of note, we find that rRF production is not affected by the identity of the processing laboratory or the library preparation kit. Conclusions: Our findings suggest that rRFs are produced in a regimented manner by currently unknown processes that are influenced by both ubiquitous as well as population-specific and sex-specific factors. The properties of rRFs mirror the previously reported properties of isomiRs and tRFs and have implications for the study of homeostasis and disease.
We sought to determine whether commercial quantitative polymerase chain reaction (qPCR) methods are capable of distinguishing isomiRs: variants of mature microRNAs (miRNAs) with sequence endpoint differences. We used two commercially available miRNA qPCR methods to quantify miR-21-5p in both synthetic and real cell contexts. We find that although these miRNA qPCR methods possess high sensitivity for specific sequences, they also pick up background signals from closely related isomiRs, which influences the reliable quantification of individual isomiRs. We conclude that these methods do not possess the requisite specificity for reliable isomiR quantification.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.