CoRegNet is an R/Bioconductor package to analyze large-scale transcriptomic data by highlighting sets of co-regulators. Based on a transcriptomic dataset, CoRegNet can be used to: reconstruct a large-scale co-regulatory network, integrate regulation evidences such as transcription factor binding sites and ChIP data, estimate sample-specific regulator activity, identify cooperative transcription factors and analyze the sample-specific combinations of active regulators through an interactive visualization tool. In this study CoRegNet was used to identify driver regulators of bladder cancer.Availability: CoRegNet is available at http://bioconductor.org/packages/CoRegNetContact: remy.nicolle@issb.genopole.fr or mohamed.elati@issb.genopole.frSupplementary information: Supplementary data are available at Bioinformatics online.
Supplementary data are available at Bioinformatics online.
The mechanisms underlying the heterogeneity of clinical malaria remain largely unknown. We hypothesized that differential gene expression contributes to phenotypic variation of parasites which results in a specific interaction with the host, leading to different clinical features of malaria. In this study, we analyzed the transcriptomes of isolates obtained from asymptomatic carriers and patients with uncomplicated or cerebral malaria. We also investigated the transcriptomes of 3D7 clone and 3D7-Lib that expresses severe malaria associated-variant surface antigen. Our findings revealed a specific up-regulation of genes involved in pathogenesis, adhesion to host cell, and erythrocyte aggregation in parasites from patients with cerebral malaria and 3D7-Lib, compared to parasites from asymptomatic carriers and 3D7, respectively. However, we did not find any significant difference between the transcriptomes of parasites from cerebral malaria and uncomplicated malaria, suggesting similar transcriptomic pattern in these two parasite populations. The difference between isolates from asymptomatic children and cerebral malaria concerned genes coding for exported proteins, Maurer's cleft proteins, transcriptional factor proteins, proteins implicated in protein transport, as well as Plasmodium conserved and hypothetical proteins. Interestingly, UPs A1, A2, A3 and UPs B1 of var genes were predominantly found in cerebral malaria-associated isolates and those containing architectural domains of DC4, DC5, DC13 and their neighboring rif genes in 3D7-lib. Therefore, more investigations are needed to analyze the effective role of these genes during malaria infection to provide with new knowledge on malaria pathology. In addition, concomitant regulation of genes within the chromosomal neighborhood suggests a common mechanism of gene regulation in P. falciparum.
One of the most challenging tasks in modern science is the development of systems biology models: Existing models are often very complex but generally have low predictive performance. The construction of high-fidelity models will require hundreds/thousands of cycles of model improvement, yet few current systems biology research studies complete even a single cycle. We combined multiple software tools with integrated laboratory robotics to execute three cycles of model improvement of the prototypical eukaryotic cellular transformation, the yeast (Saccharomyces cerevisiae) diauxic shift. In the first cycle, a model outperforming the best previous diauxic shift model was developed using bioinformatic and systems biology tools. In the second cycle, the model was further improved using automatically planned experiments. In the third cycle, hypothesis-led experiments improved the model to a greater extent than achieved using high-throughput experiments. All of the experiments were formalized and communicated to a cloud laboratory automation system (Eve) for automatic execution, and the results stored on the semantic web for reuse. The final model adds a substantial amount of knowledge about the yeast diauxic shift: 92 genes (+45%), and 1,048 interactions (+147%). This knowledge is also relevant to understanding cancer, the immune system, and aging. We conclude that systems biology software tools can be combined and integrated with laboratory robots in closed-loop cycles.
BackgroundGenome-scale metabolic models provide an opportunity for rational approaches to studies of the different reactions taking place inside the cell. The integration of these models with gene regulatory networks is a hot topic in systems biology. The methods developed to date focus mostly on resolving the metabolic elements and use fairly straightforward approaches to assess the impact of genome expression on the metabolic phenotype.ResultsWe present here a method for integrating the reverse engineering of gene regulatory networks into these metabolic models. We applied our method to a high-dimensional gene expression data set to infer a background gene regulatory network. We then compared the resulting phenotype simulations with those obtained by other relevant methods.ConclusionsOur method outperformed the other approaches tested and was more robust to noise. We also illustrate the utility of this method for studies of a complex biological phenomenon, the diauxic shift in yeast.
Complex phenotypes, such as lipid accumulation, result from cooperativity between regulators and the integration of multiscale information. However, the elucidation of such regulatory programs by experimental approaches may be challenging, particularly in context-specific conditions. In particular, we know very little about the regulators of lipid accumulation in the oleaginous yeast of industrial interest Yarrowia lipolytica. This lack of knowledge limits the development of this yeast as an industrial platform, due to the time-consuming and costly laboratory efforts required to design strains with the desired phenotypes. In this study, we aimed to identify context-specific regulators and mechanisms, to guide explorations of the regulation of lipid accumulation in Y. lipolytica. Using gene regulatory network inference, and considering the expression of 6539 genes over 26 time points from GSE35447 for biolipid production and a list of 151 transcription factors, we reconstructed a gene regulatory network comprising 111 transcription factors, 4451 target genes and 17048 regulatory interactions (YL-GRN-1) supported by evidence of protein–protein interactions. This study, based on network interrogation and wet laboratory validation (a) highlights the relevance of our proposed measure, the transcription factors influence, for identifying phases corresponding to changes in physiological state without prior knowledge (b) suggests new potential regulators and drivers of lipid accumulation and (c) experimentally validates the impact of six of the nine regulators identified on lipid accumulation, with variations in lipid content from +43.2% to −31.2% on glucose or glycerol.
Reconstruction of large scale gene regulatory networks (GRNs in the following) is an important step for understanding the complex regulatory mechanisms within the cell. Many modeling approaches have been introduced to find the causal relationship between genes using expression data. However, they have been suffering from high dimensionality-large number of genes but a small number of samples, overfitting, heavy computation time and low interpretability. We have previously proposed an original Data Mining algorithm Licorn, that infers cooperative regulation network from expression datasets. In this work, we present an extension of Licorn to a hybrid inference method h-Licorn that uses search in both discrete and real valued spaces. Licorn's algorithm, using the discrete space to find cooperative regulation relationships fitting the target gene expression, has been shown to be powerful in identifying cooperative regulation relationships that are out of the scope of most GRN inference methods. Still, as many of related GRN inference techniques, Licorn suffers from a large number of false positives. We propose here an extension of Licorn with a numerical selection step, expressed as a linear regression problem, that effectively complements the discrete search of Licorn. We evaluate a bootstrapped version of h-Licorn on the in silico Dream5 dataset and show that h-Licorn has significantly higher performance than Licorn, and is competitive or outperforms state of the art GRN inference algorithms, especially when operating on small data sets. We also applied h-Licorn on a real dataset of human bladder cancer and show that it performs better than other methods in finding candidate regulatory interactions. In particular, solely based on gene expression data, h-Licorn is able to identify experimentally validated regulator cooperative relationships involved in cancer.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.