Supplementary data are available at Bioinformatics online.
In recent years, researchers have tried to inject dynamic information into static protein interaction networks (PINs). The paper first proposes a three-sigma method to identify active time points of each protein in a cellular cycle, where three-sigma principle is used to compute an active threshold for each gene according to the characteristics of its expression curve. Then a dynamic protein interaction network (DPIN) is constructed, which includes the dynamic changes of protein interactions. To validate the efficiency of DPIN, MCL, CPM, and core attachment algorithms are applied on two different DPINs, the static PIN and the time course PIN (TC-PIN) to detect protein complexes. The performance of each algorithm on DPINs outperforms those on other networks in terms of matching with known complexes, sensitivity, specificity, f-measure, and accuracy. Furthermore, the statistics of three-sigma principle show that 23-45% proteins are active at a time point and most proteins are active in about half of cellular cycle. In addition, we find 94% essential proteins are in the group of proteins that are active at equal or great than 12 timepoints of GSE4987, which indicates the potential existence of feedback mechanisms that can stabilize the expression level of essential proteins and might provide a new insight for predicting essential proteins from dynamic protein networks.
Protein-protein interactions (PPIs) participate in all important biological processes in living organisms, such as catalyzing metabolic reactions, DNA replication, DNA transcription, responding to stimuli and transporting molecules from one location to another. To reveal the function mechanisms in cells, it is important to identify PPIs that take place in the living organism. A large number of PPIs have been discovered by high-throughput experiments and computational methods. However, false-positive PPIs have been introduced too. Therefore, to obtain reliable PPIs, many computational methods have been proposed. Generally, these methods can be classified into two categories. One category includes the methods that are designed to determine new reliable PPIs. The other one is designed to assess the reliability of existing PPIs and filter out the unreliable ones. In this article, we review the two kinds of methods for detecting reliable PPIs, and then focus on evaluating the performance of some of these typical methods. Later on, we also enumerate several PPI network-based applications with taking a reliability assessment of the PPI data into consideration. Finally, we will discuss the challenges for obtaining reliable PPIs and future directions of the construction of reliable PPI networks. Our research will provide readers some guidance for choosing appropriate methods and features for obtaining reliable PPIs.
With more dynamic information available, researchers' attention has recently shifted from static properties to dynamic properties of protein-protein interaction networks. To compensate the limited ability of technologies of detecting dynamic protein-protein interactions, dynamic protein interaction networks (DPINs) can be constructed by involving proteomic, genomic, and transcriptome analyses. Two groups of DPIN construction methods are classified based on the different focuses on dynamic information extracted from gene expression data. The dynamics of one kind of DPINs is reflected by the changes in protein presence varying with time, while that of the other kind of DPINs is reflected by the differences of coexpression under different conditions. In this review, the applications on DPINs will be discussed, including protein complexes/functional modules and network organization analysis, biomarkers detection in the progression or prognosis of the disease, and network medicine. We also point out the challenges in DPINs construction and future directions in the research of DPINs at the end of this review.
The high risk of developing colorectal carcinoma (CRC), from ulcerative colitis (UC), is well known. Macrophages are widely distributed immune cells that have an indispensable role in UC, as well as in CRC. However, little is currently known about the dynamic changes that occur in macrophage and M1/M2 macrophage subpopulations, during UC-associated carcinogenesis. The aim of the present study was to investigate the alteration of colorectal macrophages and M1/M2 macrophage subpopulations during UC-associated carcinogenesis. Both expression level alterations and functional changes were determined during UC-associated carcinogenesis in an azoxymethane/dextran sodium sulfate-induced chemically colitis-associated carcinoma mouse model of Crj:CD-1 (ICR) mice. Notable evidence from immunohistochemistry, flow cytometry, cytokine detection, and gene expression analyses demonstrated that M2 macrophages have a critical role in CRC initiation, promotion, and metastasis. M2 macrophages are associated with unbalanced pro-inflammatory and anti-inflammatory axes and aberrant enhancement of migration/invasion-associated factors. Functional changes, similar to M2 polarized macrophages, were shown to occur in the M1 macrophages, without phenotypical changes, during the development of carcinoma and metastasis. The results of the present study suggest that M2 macrophages have a pro-tumor role during UC-associated carcinogenesis. Furthermore, similar functional changes occurred in the M1 macrophages, without polarization alterations, during carcinogenesis and metastasis.
Essential proteins are vitally important for cellular survival and development, and identifying essential proteins is very meaningful research work in the post-genome era. Rapid increase of available protein-protein interaction (PPI) data has made it possible to detect protein essentiality at the network level. A series of centrality measures have been proposed to discover essential proteins based on the PPI networks. However, the PPI data obtained from large scale, high-throughput experiments generally contain false positives. It is insufficient to use original PPI data to identify essential proteins. How to improve the accuracy, has become the focus of identifying essential proteins. In this paper, we proposed a framework for identifying essential proteins from active PPI networks constructed with dynamic gene expression. Firstly, we process the dynamic gene expression profiles by using time-dependent model and time-independent model. Secondly, we construct an active PPI network based on co-expressed genes. Lastly, we apply six classical centrality measures in the active PPI network. For the purpose of comparison, other prediction methods are also performed to identify essential proteins based on the active PPI network. The experimental results on yeast network show that identifying essential proteins based on the active PPI network can improve the performance of centrality measures considerably in terms of the number of identified essential proteins and identification accuracy. At the same time, the results also indicate that most of essential proteins are active.
Background China’s “13th 5-Year Plan” (2016–2020) for the prevention and control of sudden acute infectious diseases emphasizes that epidemic monitoring and epidemic focus surveys in key areas are crucial for strengthening national epidemic prevention and building control capacity. Establishing an epidemic hot spot areas and prediction model is an effective means of accurate epidemic monitoring and surveying. Objective: This study predicted hemorrhagic fever with renal syndrome (HFRS) epidemic hot spot areas, based on multi-source environmental variable factors. We calculated the contribution weight of each environmental factor to the morbidity risk, obtained the spatial probability distribution of HFRS risk areas within the study region, and detected and extracted epidemic hot spots, to guide accurate epidemic monitoring as well as prevention and control. Methods: We collected spatial HFRS data, as well as data on various types of natural and human social activity environments in Hunan Province from 2010 to 2014. Using the information quantity method and logistic regression modeling, we constructed a risk-area-prediction model reflecting the epidemic intensity and spatial distribution of HFRS. Results: The areas under the receiver operating characteristic curve of training samples and test samples were 0.840 and 0.816. From 2015 to 2019, HRFS case site verification showed that more than 82% of the cases occurred in high-risk areas. Discussion This research method could accurately predict HFRS hot spot areas and provided an evaluation model for Hunan Province. Therefore, this method could accurately detect HFRS epidemic high-risk areas, and effectively guide epidemic monitoring and surveyance.
Essential proteins are indispensable for living organisms to maintain life activities and play important roles in the studies of pathology, synthetic biology, and drug design. Therefore, besides experiment methods, many computational methods are proposed to identify essential proteins. Based on the centrality-lethality rule, various centrality methods are employed to predict essential proteins in a Protein-protein Interaction Network (PIN). However, neglecting the temporal and spatial features of protein-protein interactions, the centrality scores calculated by centrality methods are not effective enough for measuring the essentiality of proteins in a PIN. Moreover, many methods, which overfit with the features of essential proteins for one species, may perform poor for other species. In this paper, we demonstrate that the centrality-lethality rule also exists in Protein Subcellular Localization Interaction Networks (PSLINs). To do this, a method based on Localization Specificity for Essential protein Detection (LSED), was proposed, which can be combined with any centrality method for calculating the improved centrality scores by taking into consideration PSLINs in which proteins play their roles. In this study, LSED was combined with eight centrality methods separately to calculate Localization-specific Centrality Scores (LCSs) for proteins based on the PSLINs of four species (Saccharomyces cerevisiae, Homo sapiens, Mus musculus and Drosophila melanogaster). Compared to the proteins with high centrality scores measured from the global PINs, more proteins with high LCSs measured from PSLINs are essential. It indicates that proteins with high LCSs measured from PSLINs are more likely to be essential and the performance of centrality methods can be improved by LSED. Furthermore, LSED provides a wide applicable prediction model to identify essential proteins for different species.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.