It is now established that the transcription factors E2A, EBF1 and Foxo1 play critical roles in B cell development. Here we show that E2A and EBF1 bound regulatory elements present in the Foxo1 locus. E2A and EBF1 as well as E2A and Foxo1, in turn, were wired together by a vast spectrum of cis-regulatory codes. These associations were dynamic during developmental progression. Occupancy by the E2A isoform, E47, directly elevated the abundance as well as the pattern of histone H3K4 monomethylation across putative enhancer regions. Finally, the pro-B cell epigenome was divided into clusters of loci that show E2A, EBF and Foxo1 occupancy. From this analysis a global network consisting of transcriptional regulators, signaling and survival factors, was constructed that we propose orchestrates the B cell fate.
Although cellular behaviors are dynamic, the networks that govern these behaviors have been mapped primarily as static snapshots. Using an approach called differential epistasis mapping, we have discovered widespread changes in genetic interaction among yeast kinases, phosphatases, and transcription factors as the cell responds to DNA damage. Differential interactions uncover many gene functions that go undetected in static conditions. They are very effective at identifying DNA repair pathways, highlighting new damage-dependent roles for the Slt2 kinase, Pph3 phosphatase, and histone variant Htz1. The data also reveal that protein complexes are generally stable in response to perturbation, but the functional relations between these complexes are substantially reorganized. Differential networks chart a new type of genetic landscape that is invaluable for mapping cellular responses to stimuli.One of the most basic approaches to understanding gene function relies on the identification of genetic interactions, which occur when the phenotypic effects of one gene depend on the presence of a second. Recently, a number of technologies have been developed to systematically map genetic interaction networks over large sets of genes in budding yeast (1-3) and other model organisms (4,5 To gain insight into how genetic networks are altered by stress, we assembled a large genetic interactome with and without perturbation by the DNA-damaging agent methyl methane-sulfonate (MMS). Using the technique of epistatic miniarray profiles (E-MAP) (8), genetic interactions were interrogated among a set of 418 yeast genes selected to provide broad coverage of the cellular signaling and transcriptional machinery, including nearly all yeast kinases, phosphatases, and transcription factors, as well as known DNA repair factors ( fig. S1 and table S1). About 80,000 double-mutant strains were generated from all pairwise mutant combinations of the 418 genes, in which mutations were complete gene deletions (nonessential genes) or hypomorphic alleles (essential genes) as appropriate. Double-mutant combinations were grown with or without 0.02% MMS, and their colony sizes were analyzed statistically to compute a genetic interaction score (S score) in each condition (9), which indicates whether the strain was healthier or sicker than expected (positive or negative S, respectively) (10).From established score thresholds for positive and negative interactions (S ≥ +2.0, S ≤ −2.5) (9) we identified two genetic networks: a set of 1905 interactions for the untreated condition, and a set of 2297 interactions under MMS. Analysis of these "static" genetic maps showed strong associations with physical interaction networks of various kinds. For example, gene pairs with either positive or negative genetic interactions were highly enriched for proteins known to physically interact. In addition, both maps were enriched for known kinase-and phosphatase-substrate pairs, as well as transcription factor-target pairs ( fig. S2). The correspondence to physical...
Ontologies have proven very useful for capturing knowledge as a hierarchy of terms and their interrelationships. In biology a major challenge has been to construct ontologies of gene function given incomplete biological knowledge and inconsistencies in how this knowledge is manually curated. Here we show that large networks of gene and protein interactions in Saccharomyces cerevisiae can be used to infer an ontology whose coverage and power are equivalent to those of the manually curated Gene Ontology (GO). The network-extracted ontology (NeXO) contains 4,123 biological terms and 5,766 term-term relations, capturing 58% of known cellular components. We also explore robust NeXO terms and term relations that were initially not cataloged in GO, a number of which have now been added based on our analysis. Using quantitative genetic interaction profiling and chemogenomics, we find further support for many of the uncharacterized terms identified by NeXO, including multisubunit structures related to protein trafficking or mitochondrial function. This work enables a shift from using ontologies to evaluate data to using data to construct and evaluate ontologies.
Complexity is the grand challenge for science and engineering in the 21st century. We suggest that biology is a discipline that is uniquely situated to tackle complexity, through a diverse array of technologies for characterizing molecular structure, interactions and function. A major difficulty in the analysis of complex biological systems is dealing with the low signal-to-noise inherent to nearly all large-scale biological data sets. We discuss powerful bioinformatic concepts for boosting signal-to-noise through external knowledge incorporated in processing units we call Filters and Integrators. These concepts are illustrated in four landmark studies that have provided model implementations of Filters, Integrators, or both.
SUMMARY Recent studies have documented genome-wide binding patterns of transcriptional regulators and their associated epigenetic marks in hematopoietic cell lineages. In order to determine how epigenetic marks are established and maintained during developmental progression, we have generated long-term cultures of hematopoietic progenitors by enforcing the expression of the E-protein antagonist Id2. Hematopoietic progenitors that express Id2 are multipotent and readily differentiate upon withdrawal of Id2 expression into committed B lineage cells, thus indicating a causative role for E2A (Tcf3) in promoting the B cell fate. Genome-wide analyses revealed that a substantial fraction of lymphoid and myeloid enhancers are pre-marked by the poised or active enhancer mark H3K4Me1 in multipotent progenitors. Thus, in hematopoietic progenitors, multilineage priming of enhancer elements precedes commitment to the lymphoid or myeloid cell lineages.
Although it is well-established that the macrophage M1 to M2 transition plays a role in tumor progression, the molecular basis for this process remains incompletely understood. Herein, we demonstrate that the small GTPase, Rac2 controls macrophage M1 to M2 differentiation and the metastatic phenotype in vivo. Using a genetic approach, combined with syngeneic and orthotopic tumor models we demonstrate that Rac2-/- mice display a marked defect in tumor growth, angiogenesis and metastasis. Microarray, RT-PCR and metabolomic analysis on bone marrow derived macrophages isolated from the Rac2-/- mice identify an important role for Rac2 in M2 macrophage differentiation. Furthermore, we define a novel molecular mechanism by which signals transmitted from the extracellular matrix via the α4β1 integrin and MCSF receptor lead to the activation of Rac2 and potentially regulate macrophage M2 differentiation. Collectively, our findings demonstrate a macrophage autonomous process by which the Rac2 GTPase is activated downstream of the α4β1 integrin and the MCSF receptor to control tumor growth, metastasis and macrophage differentiation into the M2 phenotype. Finally, using gene expression and metabolomic data from our Rac2-/- model, and information related to M1-M2 macrophage differentiation curated from the literature we executed a systems biologic analysis of hierarchical protein-protein interaction networks in an effort to develop an iterative interactome map which will predict additional mechanisms by which Rac2 may coordinately control macrophage M1 to M2 differentiation and metastasis.
SummaryAccurately translating genotype to phenotype requires accounting for the functional impact of genetic variation at many biological scales. Here we present a strategy for genotype-phenotype reasoning based on existing knowledge of cellular subsystems. These subsystems and their hierarchical organization are defined by the Gene Ontology or a complementary ontology inferred directly from previously published datasets. Guided by the ontology’s hierarchical structure, we organize genotype data into an “ontotype,” that is, a hierarchy of perturbations representing the effects of genetic variation at multiple cellular scales. The ontotype is then interpreted using logical rules generated by machine learning to predict phenotype. This approach substantially outperforms previous, non-hierarchical methods for translating yeast genotype to cell growth phenotype, and it accurately predicts the growth outcomes of two new screens of 2,503 double gene knockouts impacting DNA repair or nuclear lumen. Ontotypes also generalize to larger knockout combinations, setting the stage for interpreting the complex genetics of disease.
Motivation: While the manually curated Gene Ontology (GO) is widely used, inferring a GO directly from -omics data is a compelling new problem. Recognizing that ontologies are a directed acyclic graph (DAG) of terms and hierarchical relations, algorithms are needed that: analyze a full matrix of gene–gene pairwise similarities from -omics data;infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; andrespect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference.Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast.Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall).Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data.Contact: tideker@ucsd.edu
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.