High-throughput RNA sequencing offers unprecedented opportunities to explore the Earth RNA virome. Mining 5,150 diverse metatranscriptomes uncovered >2.5 million RNA viral contigs. Via analysis of the 330k novel RNA-dependent RNA polymerases (RdRP), this expansion corresponds to a five-fold increase of RNA virus diversity. Extended RdRP phylogeny supports monophyly of the five established phyla, reveals two putative new bacteriophage phyla and numerous putative novel classes and orders. The dramatically expanded Lenarviricota phylum, consisting of bacterial and related eukaryotic viruses, now accounts for a third of the RNA virome diversity. Identification of CRISPR spacer matches and bacteriolytic proteins suggests that subsets of picobirnaviruses and partitiviruses, previously associated with eukaryotes, infect prokaryotic hosts. Gene content analysis revealed multiple domains previously not found in RNA viruses and implicated in virus-host interactions. This vast collection of new RNA virus genomes provides insights into RNA virus evolution and should become a major resource for RNA virology.
Machine learning is a modern approach to problem-solving and task automation. In particular, machine learning is concerned with the development and applications of algorithms that PLOS COMPUTATIONAL BIOLOGY
Objective To describe the development of an evaluation framework that allows quantification of surveillance functions and subsequent aggregation towards an overall score for biosurveillance system performance. Introduction Evaluation and strengthening of biosurveillance systems is a complex process that involves sequential decision steps, numerous stakeholders, and requires accommodating multiple and conflicting objectives. Biosurveillance evaluation, the initiating step towards biosurveillance strengthening, is a multi-dimensional decision problem that can be properly addressed via multi-criteria-decision models. Existing evaluation frameworks tend to focus on "hard" technical attributes (e.g. sensitivity) while ignoring other "soft" criteria (e.g. transparency) of difficult measurement and aggregation. As a result, biosurveillance value, a multi-dimensional entity, is not properly defined or assessed. Not addressing the entire range of criteria leads to partial evaluations that may fail to convene sufficient support across the stakeholders' base for biosurveillance improvements. We seek to develop a generic and flexible evaluation framework capable of integrating the multiple and conflicting criteria and values of different stakeholders, and which is sufficiently tractable to allow quantification of the value of specific biosurveillance projects towards the overall performance of biosurveillance systems.
We introduce ViroidDB, a value-added database that attempts to collect all known viroid and viroid-like circular RNA sequences into a single resource. Spanning about 10 000 unique sequences, ViroidDB includes viroids, retroviroid-like elements, small circular satellite RNAs, ribozyviruses, and retrozymes. Each sequence's secondary structure, ribozyme content, and cluster membership are predicted via a custom pipeline optimized for handling circular RNAs. The data can be explored via a purpose-built user interface that features visualizations, multiple sequence alignments, and a portal for downloading bulk data. Users can browse the data by sequence type, taxon, or typo-tolerant search of metadata fields. The database is freely accessible at https://viroids.org.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.