MicroRNAs (miRNAs) are small non-coding RNAs involved in post-transcriptional gene regulation that have a major impact on many diseases and provides an exciting avenue towards antiviral therapeutics. From patient transcriptomic data, we determined a circulating miRNA, miR-2392, is directly involved with SARS-CoV-2 machinery during host infection. Specifically, we show that miR-2392 is key in driving downstream suppression of mitochondrial gene expression, increasing inflammation, glycolysis, and hypoxia as well as promoting many symptoms associated with COVID-19 infection. We demonstrate miR-2392 is present in the blood and urine of patients positive for COVID-19, but not present in patients negative for COVID-19. These findings indicate the potential for developing a minimally invasive COVID-19 detection method. Lastly, using in vitro human and in vivo hamster models, we design a miRNA-based antiviral therapeutic that targets miR-2392, significantly reduces SARS-CoV-2 viability in hamsters and may potentially inhibit a COVID-19 disease state in humans.
Long non-coding RNAs (lncRNAs) make up a significant portion of non-coding RNAs and are involved in a variety of biological processes. Accurate identification/annotation of lncRNAs is the primary step for gaining deeper insights into their functions. In this study, we report a novel tool, PLncPRO, for prediction of lncRNAs in plants using transcriptome data. PLncPRO is based on machine learning and uses random forest algorithm to classify coding and long non-coding transcripts. PLncPRO has better prediction accuracy as compared to other existing tools and is particularly well-suited for plants. We developed consensus models for dicots and monocots to facilitate prediction of lncRNAs in non-model/orphan plants. The performance of PLncPRO was quite better with vertebrate transcriptome data as well. Using PLncPRO, we discovered 3714 and 3457 high-confidence lncRNAs in rice and chickpea, respectively, under drought or salinity stress conditions. We investigated different characteristics and differential expression under drought/salinity stress conditions, and validated lncRNAs via RT-qPCR. Overall, we developed a new tool for the prediction of lncRNAs in plants and showed its utility via identification of lncRNAs in rice and chickpea.
Motivation The goal of phylostratigraphy is to infer the evolutionary origin of each gene in an organism. This is done by searching for homologs within increasingly broad clades. The deepest clade that contains a homolog of the protein(s) encoded by a gene is that gene’s phylostratum. Results We have created a general R-based framework, phylostratr, to estimate the phylostratum of every gene in a species. The program fully automates analysis: selecting species for balanced representation, retrieving sequences, building databases, inferring phylostrata and returning diagnostics. Key diagnostics include: detection of genes with inferred homologs in old clades, but not intermediate ones; proteome quality assessments; false-positive diagnostics, and checks for missing organellar genomes. phylostratr allows extensive customization and systematic comparisons of the influence of analysis parameters or genomes on phylostrata inference. A user may: modify the automatically generated clade tree or use their own tree; provide custom sequences in place of those automatically retrieved from UniProt; replace BLAST with an alternative algorithm; or tailor the method and sensitivity of the homology inference classifier. We show the utility of phylostratr through case studies in Arabidopsis thaliana and Saccharomyces cerevisiae. Availability and implementation Source code available at https://github.com/arendsee/phylostratr. Supplementary information Supplementary data are available at Bioinformatics online.
The molecular mechanisms underlying the clinical manifestations of COVID-19 and what distinguishes them from common seasonal influenza virus and other lung injury states such as Acute Respiratory Distress Syndrome, remains poorly understood. To address these challenges, we combine transcriptional profiling of 646 clinical nasopharyngeal swabs and 39 patient autopsy tissues to define body-wide transcriptome changes in response to COVID-19. We then match this data with spatial protein and expression profiling across 357 tissue sections from 16 representative patient lung samples and identify tissue compartment-specific damage wrought by SARS-CoV-2 infection, evident as a function of varying viral loads during the clinical course of infection and tissue type specific expression states. Overall, our findings reveal a systemic disruption of canonical cellular and transcriptional pathways across all tissues, which can inform subsequent studies to combat the mortality of COVID-19 and to better understand the molecular dynamics of lethal SARS-CoV-2 and other respiratory infections.
Unlike dicots, the robust root system in grass species largely originates from stem base during postembryonic development. The mechanisms by which plant hormone signaling pathways control the architecture of adventitious root remain largely unknown. Here, we studied the modulations in global genes activity in developing rice adventitious root by genome-wide RNA sequencing in response to external auxin and cytokinin signaling cues. We further analyzed spatiotemporal regulations of key developmental regulators emerged from our global transcriptome analysis. Interestingly, some of the key cell fate determinants such as homeodomain transcription factor (TF), OsHOX12, no apical meristem protein, OsNAC39, APETALA2/ethylene response factor, OsAP2/ERF-40 and WUSCHEL-related homeobox, OsWOX6.1 and OsWOX6.2, specifically expressed in adventitious root primordia. Functional analysis of one of these regulators, an auxin-induced TF containing AP2/ERF domain, OsAP2/ERF-40, demonstrates its sufficiency to confer the adventitious root fate. The ability to trigger the root developmental program is largely attributed to OsAP2/ERF-40-mediated dose-dependent transcriptional activation of genes that can facilitate generating effective auxin response, and OsERF3–OsWOX11–OsRR2 pathway. Our studies reveal gene regulatory network operating in response to hormone signaling pathways and identify a novel TF regulating adventitious root developmental program, a key agronomically important quantitative trait, upstream of OsERF3–OsWOX11–OsRR2 pathway.
The diverse and growing omics data in public domains provide researchers with tremendous opportunity to extract hidden, yet undiscovered, knowledge. However, the vast majority of archived data remain unused. Here, we present MetaOmGraph (MOG), a free, open-source, standalone software for exploratory analysis of massive datasets. Researchers, without coding, can interactively visualize and evaluate data in the context of its metadata, honing-in on groups of samples or genes based on attributes such as expression values, statistical associations, metadata terms and ontology annotations. Interaction with data is easy via interactive visualizations such as line charts, box plots, scatter plots, histograms and volcano plots. Statistical analyses include co-expression analysis, differential expression analysis and differential correlation analysis, with significance tests. Researchers can send data subsets to R for additional analyses. Multithreading and indexing enable efficient big data analysis. A researcher can create new MOG projects from any numerical data; or explore an existing MOG project. MOG projects, with history of explorations, can be saved and shared. We illustrate MOG by case studies of large curated datasets from human cancer RNA-Seq, where we identify novel putative biomarker genes in different tumors, and microarray and metabolomics data from Arabidopsis thaliana. MOG executable and code: http://metnetweb.gdcb.iastate.edu/ and https://github.com/urmi-21/MetaOmGraph/.
Analysis of yeast, fly and human genomes suggests that sequence divergence is not the main source of orphan genes.
Summary Searching for open reading frames is a routine task and a critical step prior to annotating protein coding regions in newly sequenced genomes or de novo transcriptome assemblies. With the tremendous increase in genomic and transcriptomic data, faster tools are needed to handle large input datasets. These tools should be versatile enough to fine-tune search criteria and allow efficient downstream analysis. Here we present a new python based tool, orfipy, which allows the user to flexibly search for open reading frames in genomic and transcriptomic sequences. The search is rapid and is fully customizable, with a choice of FASTA and BED output formats. Availability and implementation orfipy is implemented in python and is compatible with python v3.6 and higher. Source code: https://github.com/urmi-21/orfipy. Installation: from the source, or via PyPi (https://pypi.org/project/orfipy) or bioconda (https://anaconda.org/bioconda/orfipy). Supplementary information Supplementary data are available at https://github.com/urmi-21/orfipy and Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.