BackgroundMetagenomics, based on culture-independent sequencing, is a well-fitted approach to provide insights into the composition, structure and dynamics of environmental viral communities. Following recent advances in sequencing technologies, new challenges arise for existing bioinformatic tools dedicated to viral metagenome (i.e. virome) analysis as (i) the number of viromes is rapidly growing and (ii) large genomic fragments can now be obtained by assembling the huge amount of sequence data generated for each metagenome.ResultsTo face these challenges, a new version of Metavir was developed. First, all Metavir tools have been adapted to support comparative analysis of viromes in order to improve the analysis of multiple datasets. In addition to the sequence comparison previously provided, viromes can now be compared through their k-mer frequencies, their taxonomic compositions, recruitment plots and phylogenetic trees containing sequences from different datasets. Second, a new section has been specifically designed to handle assembled viromes made of thousands of large genomic fragments (i.e. contigs). This section includes an annotation pipeline for uploaded viral contigs (gene prediction, similarity search against reference viral genomes and protein domains) and an extensive comparison between contigs and reference genomes. Contigs and their annotations can be explored on the website through specifically developed dynamic genomic maps and interactive networks.ConclusionsThe new features of Metavir 2 allow users to explore and analyze viromes composed of raw reads or assembled fragments through a set of adapted tools and a user-friendly interface.
Genomics experiments are widely acknowledged to produce a huge amount of data to be analysed. The challenge is to extract meaningful biological context for proteins or genes which is currently difficult because of the lack of an integrative workflow that hinders the efficiency and the robustness of data mining performed by biologists working on ruminants. Thus, we designed ProteINSIDE, a free web service (www.proteinside.org) that (I) provides an overview of the biological information stored in public databases or provided by annotations according to the Gene Ontology, (II) predicts proteins that are secreted to search for proteins that mediate signalisation between cells or tissues, and (III) analyses protein-protein interactions to identify proteins contributing to a process or to visualize functional pathways. Using lists of proteins or genes as a unique input, ProteINSIDE is an original all-in-one tool that merges data from these searches to present a fast overview and integrative analysis of genomic and proteomic data from Bovine, Ovine, Caprine, Human, Rat, and Murine species. ProteINSIDE was bench tested with 1000 proteins identifiers from each species by comparison with DAVID, BioMyn, AgBase, PrediSi, and Phobius. Compared to DAVID or BioMyn, identifications and annotations provided by ProteINSIDE were similar from monogastric proteins but more numerous and relevant for ruminants proteins. ProteINSIDE, thanks to SignalP, listed less proteins potentially secreted with a signal peptide than PrediSi and Phobius, in agreement with the low false positive rate of SignalP. In addition ProteINSIDE is the only resource that predicts proteins secreted by cellular processes that do not involve a signal peptide. Lastly, we reported the usefulness of ProteINSIDE to bring new biological hypotheses of research from proteomics data: the biological meaning of the uptake of adiponectin by the foetal muscle and a role for autophagy during ontogenesis of adipose and muscle tissues.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.