Cancer is a multistep process characterized by altered signal transduction, cell growth, and metabolism. To identify such processes in early carcinogenesis we use an information theoretic approach to characterize gene expression quantified as mRNA levels in primary keratinocytes (K) and human papillomavirus 16 (HPV16)-transformed keratinocytes (HF1 cells) from early (E) and late (L) passages and from benzo(a)pyrene-treated (BP) L cells. Our starting point is that biological signaling processes are subjected to the same quantitative laws as inanimate, nonequilibrium chemical systems. Environmental and genomic constraints thereby limit the maximal thermodynamic entropy that the biological system can reach. The procedure uncovers the changes in gene expression patterns in different networks and defines the significance of each altered network in the establishment of a particular phenotype. The development of transformed HF1 cells is shown to be represented by one major transcription pattern that is important at all times. Two minor transcription patterns are also identified, one that contributes at early times and a distinguishably different pattern that contributes at later times. All three transcription patterns defined by our analysis were validated by gene expression values and biochemical means. The major transcription pattern includes reduced transcripts participating in the apoptotic network and enhanced transcripts participating in cell cycle, glycolysis, and oxidative phosphorylation. The two minor patterns identify genes that are mainly involved in lipid or carbohydrate metabolism. microarray analysis | oncogenic transformation | surprisal analysis | maximal entropy | gene transcription patterns G ene expression profiling describes the transcription patterns of thousands of mRNAs at the same time point, allowing insight into or comparison of different cellular conditions. Regulation of gene expression is relevant to many areas of biology and medicine, including the study of different diseases and specifically cancer. To cope with the massive amount of available microarray data [see, for example, the Gene Expression Omnibus (GEO) database], many software packages have been developed (1). These techniques identify a list of "interesting" genes and search for their biological relevance. The techniques used for analysis of microarray data can identify networks that have been changed at each condition. However, it is not possible to delineate the significance of such overall changes to the different transcription patterns that are associated with the different phenotypes. We here propose and apply a physically motivated global method of gene expression analysis that seeks to uncover both the changes in expression patterns of different networks and the significance of each altered network in the establishment of each particular phenotype.Cancer is an evolving, complex system, which goes through several stages before full malignancy. To demonstrate the application of our method we compare gene expression between...
Computers are organized into hardware and software. Using a theoretical approach to identify patterns in gene expression in a variety of species, organs, and cell types, we found that biological systems similarly are comprised of a relatively unchanging hardware-like gene pattern. Orthogonal patterns of software-like transcripts vary greatly, even among tumors of the same type from different individuals. Two distinguishable classes could be identified within the hardware-like component: those transcripts that are highly expressed and stable and an adaptable subset with lower expression that respond to external stimuli. Importantly, we demonstrate that this structure is conserved across organisms. Deletions of transcripts from the highly stable core are predicted to result in cell mortality. The approach provides a conceptual thermodynamic-like framework for the analysis of gene-expression levels and networks and their variations in diseased cells.
The past years have witnessed a rapid increase in the amount of large-scale tumor datasets. The challenge has now become to find a way to obtain useful information from these masses of data that will allow to determine which combination of FDA-approved drugs is best suited to treat the specific tumor. Various statistical analyses are being developed to extract significant signals from cancer datasets. However, tumors are still being assigned to pre-defined categories (breast luminal A, triple negative, etc.), conceptually contradicting the vast heterogeneity that is known to exist among tumors, and likely overlooking unique tumors that must be addressed and treated individually. We present herein an approach based on information theory that, rather than searches for what makes a tumor similar to other tumors, addresses tumors individually and unbiasedly, and impartially decodes the critical patient-specific molecular network reorganization in every tumor. Methods : Using a large dataset obtained from ~3500 tumors of 11 types we decipher the altered protein network structure in each tumor, namely the patient-specific signaling signature. Each signature can harbor several altered protein subnetworks. We suggest that simultaneous targeting of central proteins from every altered subnetwork is essential to efficiently disturb the altered signaling in each tumor. We experimentally validate our ability to dissect sample-specific signaling signatures and to rationally design personalized drug combinations. Results : We unraveled a surprisingly simple order that underlies the extreme apparent complexity of tumor tissues, demonstrating that only 17 altered protein subnetworks characterize ~3500 tumors of 11 types. Each tumor was described by a specific subset of 1-4 subnetworks out of 17, i.e. a tumor-specific altered signaling signature. We show that the majority of tumor-specific signaling signatures are extremely rare, and are shared by only 5 tumors or less, supporting a personalized, comprehensive study of tumors in order to design the optimal combination therapy for every patient. We validate the results by confirming that the processes identified in the 11 original cancer types characterize patients harboring a different cancer type as well. We show experimentally, using different cancer cell lines, that the individualized combination therapies predicted by us achieved higher rates of killing than the clinically prescribed treatments. Conclusions : We present a new strategy to deal with the inter-tumor heterogeneity and to break down the high complexity of cancer systems into simple, easy to crack, patient-specific signaling signatures that guide the rational design of personalized drug therapies.
Every individual cancer develops and grows in its own specific way, giving rise to a recognized need for the development of personalized cancer diagnostics. This suggested that the identification of patient-specific oncogene markers would be an effective diagnostics approach. However, tumors that are classified as similar according to the expression levels of certain oncogenes can eventually demonstrate divergent responses to treatment. This implies that the information gained from the identification of tumor-specific biomarkers is still not sufficient. We present a method to quantitatively transform heterogeneous big cancer data to patient-specific transcription networks. These networks characterize the unbalanced molecular processes that deviate the tissue from the normal state. We study a number of datasets spanning five different cancer types, aiming to capture the extensive interpatient heterogeneity that exists within a specific cancer type as well as between cancers of different origins. We show that a relatively small number of altered molecular processes suffices to accurately characterize over 500 tumors, showing extreme compaction of the data. Every patient is characterized by a small specific subset of unbalanced processes. We validate the result by verifying that the processes identified characterize other cancer patients as well. We show that different patients may display similar oncogene expression levels, albeit carrying biologically distinct tumors that harbor different sets of unbalanced molecular processes. Thus, tumors may be inaccurately classified and addressed as similar. These findings highlight the need to expand the notion of tumor-specific oncogenic biomarkers to patient-specific, comprehensive transcriptional networks for improved patient-tailored diagnostics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.