A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks

Wang, Jie; Zheng, Wei; Liang, Jiye

doi:10.3390/molecules22122179

Cited by 11 publications

(5 citation statements)

References 44 publications

(69 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To assess the effectiveness of our proposed method, we compared it against several existing algorithms, namely HGCA (Wang et al, 2019), IPCA (Li et al, 2008), DCU (Zhao et al, 2014), and SEGC (Wang et al, 2017). In this comparative analysis, we specifically focused on methods that strive to encompass nearly 100 percent of the proteins when constructing complexes, ensuring that each node is included in at least one complex.…”

Section: Comparison With Other Methodsmentioning

confidence: 99%

“…This recursive process ensures a comprehensive expansion of the cluster, guided by the local structural properties of the protein interaction network. SEGC, introduced in (Wang et al, 2017), presents a unique approach to seed selection by employing a roulette wheel strategy, thereby enhancing the diversity of clusters. Through the evaluation of both the cluster density and the connection of a node u to cluster C, the algorithm computes the closeness measure NC(u, C), indicating the proximity of the node to the cluster.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Enhanced Protein Complex Detection Using Square Clustering Coefficient

Mirzaee,

charkari,

Roayaei

2024

Preprint

View full text Add to dashboard Cite

Identifying protein complexes from protein-protein interaction networks is one of the crucial tasks in computational biology. Traditional methods, along with their shortcomings in fully understanding protein complex composition, also have inherent limitations and are expensive to implement. In this paper, we introduce a novel method that not only acknowledges but actively tackles these challenges. Our approach, centered around a core-attachment framework, employs a blend of topological metrics, such as square clustering coefficients, in conjunction with traditional clustering coefficients. After establishing the core, we incorporate attachment proteins based on specific conditions employing a based depth-first approach to form a protein complex. By harnessing multiple metrics, our goal is to elevate the accuracy of protein complex identification beyond what single-metric approaches can achieve. To validate the effectiveness of our approach, we conducted extensive experiments using multiple datasets, including Gavin06, Krogan core, Krogan extend, and DIP datasets, and assessed metrics such as precision, recall, F-measure, and coverage. Our results not only demonstrate the superiority of our method over traditional approaches but also align with findings from related studies. Overall, our study contributes to the ongoing efforts in computational biology by presenting a comprehensive approach to protein complex identification that addresses the shortcomings of previous methods. Through a combination of innovative techniques and insights from recent research, we aim to push the boundaries of accuracy and comprehensiveness in protein complex detection.

show abstract

Section: Comparison With Other Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Enhanced Protein Complex Detection Using Square Clustering Coefficient

Mirzaee,

charkari,

Roayaei

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…To account for the minimum diameter and average node distance characteristics of protein complexes, the improved DPClus algorithm (IPCA) [17] enhances DPClus through the integration of sub-graph diameters and interaction probabilities, which provide insights into the density of the network. Other methods in this category include SEGC [18], Core [19], etc.…”

Section: Introductionmentioning

confidence: 99%

A Network Clustering Algorithm for Protein Complex Detection Fused with Power-Law Distribution Characteristic

et al. 2023

Self Cite

View full text Add to dashboard Cite

Network clustering for mining protein complexes from protein–protein interaction (PPI) networks has emerged as a prominent research area in data mining and bioinformatics. Accurately identifying complexes plays a crucial role in comprehending cellular organization and functionality. Network characteristics are often useful in enhancing the performance of protein complex detection methods. Many protein complex detection algorithms have been proposed, primarily focusing on local micro-topological structure metrics while overlooking the potential power-law distribution characteristic of community sizes at the macro global level. The effective use of this distribution characteristic information may be beneficial for mining protein complexes. This paper proposes a network clustering algorithm for protein complex detection fused with power-law distribution characteristic. The clustering algorithm constructs a cluster generation model based on scale-free power-law distribution to generate a cluster with a dense center and relatively sparse periphery. Following the cluster generation model, a candidate cluster is obtained. From a global perspective, the number distribution of clusters of varying sizes is taken into account. If the candidate cluster aligns with the constraints defined by the power-law distribution function of community sizes, it is designated as the final cluster; otherwise, it is discarded. To assess the prediction performance of the proposed algorithm, the gold standard complex sets CYC2008 and MIPS are employed as benchmarks. The algorithm is compared to DPClus, IPCA, SEGC, Core, SR-MCL, and ELF-DPC in terms of F-measure and Accuracy on several widely used protein–protein interaction networks. The experimental results show that the algorithm can effectively detect protein complexes and is superior to other comparative algorithms. This study further enriches the connection between analyzing complex network topology features and mining network function modules, thereby significantly contributing to the improvement of protein complex detection performance.

show abstract

“…There have been various different clustering methods in the literature. In general, they can be classified into 6 categories, that is density-based (c.f., DP [8], DP-HD [9], DBSCAN [10], NQ-DBSCAN [11], CSSub [12] and GDPC [13]), grid-based (c.f., CLIQUE [14], Gridwave [15] and WaveCluster [16]), model-based (c.f., Gaussian parsimonious [17], Gaussian mixture models [18] and Latent tree models [19]), partition-ing (c.f., K-means [20,21,22], K-partitioning [23] and TLBO [24]), graph-based (SEGC [25], ProClust [26] and MCSSGC [27]), and hierarchical (c.f., BIRCH [28] and CHAMELEON [29]) approaches.…”

Section: Introductionmentioning

confidence: 99%

A self-adaptive and robust fission clustering algorithm via heat diffusion and maximal turning angle

Han¹,

Lu²,

Xu³

2021

Preprint

View full text Add to dashboard Cite

Cluster analysis, which focuses on the grouping and categorization of similar elements, is widely used in various fields of research. A novel and fast clustering algorithm, fission clustering algorithm, is proposed in recent year. In this article, we propose a robust fission clustering (RFC) algorithm and a self-adaptive noise identification method. The RFC and the self-adaptive noise identification method are combine to propose a self-adaptive robust fission clustering (SARFC) algorithm. Several frequently-used datasets were applied to test the performance of the proposed clustering approach and to compare the results with those of other algorithms. The comprehensive comparisons indicate that the proposed method has advantages over other common methods.

show abstract

A Seed Expansion Graph Clustering Method for Protein Complexes Detection in Protein Interaction Networks

Cited by 11 publications

References 44 publications

Enhanced Protein Complex Detection Using Square Clustering Coefficient

Enhanced Protein Complex Detection Using Square Clustering Coefficient

A Network Clustering Algorithm for Protein Complex Detection Fused with Power-Law Distribution Characteristic

A self-adaptive and robust fission clustering algorithm via heat diffusion and maximal turning angle

Contact Info

Product

Resources

About