2013
DOI: 10.1371/journal.pone.0076339
|View full text |Cite|
|
Sign up to set email alerts
|

Going the Distance for Protein Function Prediction: A New Distance Metric for Protein Interaction Networks

Abstract: In protein-protein interaction (PPI) networks, functional similarity is often inferred based on the function of directly interacting proteins, or more generally, some notion of interaction network proximity among proteins in a local neighborhood. Prior methods typically measure proximity as the shortest-path distance in the network, but this has only a limited ability to capture fine-grained neighborhood distinctions, because most proteins are close to each other, and there are many ties in proximity. We intro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
194
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 110 publications
(197 citation statements)
references
References 27 publications
3
194
0
Order By: Relevance
“…We showed in [8] for any fixed k , that DSD is a true distance metric, namely that it is symmetric, positive definite, and non-zero whenever u ≠ v , and it obeys the triangle inequality. Thus, one can use DSD to reason about distances in a network in a sound manner.…”
Section: Review Of Dsdmentioning
confidence: 99%
See 1 more Smart Citation
“…We showed in [8] for any fixed k , that DSD is a true distance metric, namely that it is symmetric, positive definite, and non-zero whenever u ≠ v , and it obeys the triangle inequality. Thus, one can use DSD to reason about distances in a network in a sound manner.…”
Section: Review Of Dsdmentioning
confidence: 99%
“…In 2013, Cao et al introduced a new distance measure called Diffusion State Distance, or DSD, designed to be a more fine-grained distance measure for protein-protein interaction networks [8]. In contrast to the typical shortest path metric, which measures distance between pairs of nodes by the number of hops on the shortest path that joins them in the network, DSD was shown to spread out the pairwise distances, making for a more fine-grained notion of graph local neighborhood.…”
Section: Introductionmentioning
confidence: 99%
“…We set a global threshold D based on the average DSD of all pairs, specifically we set D=μc*σ, where μ is the average, and σ is the standard deviation of the global set of DSD values among all pairs of nodes in the graph. As in (Cao et al , 2013), we set c = 1.5.…”
Section: Methodsmentioning
confidence: 99%
“…We first calculate He { k } ( A , B ) as the expected number of times that a random walk starting at node A and proceeding for k steps, will visit node B; then we further define a n -dimensional vector He{k}(vi),viV, where He{k}(vi)=(He{k}(vi,v1),He{k}(vi,v2),...,He{k}(vi,vn)). In what follows, the k -step DSD between two vertices u and v,u,vV is defined as DSD{k}(u,v)=||He{k}(u)He{k}(v)|false|1, where ||He{k}(u)He{k}(v)|false|1 denotes the L 1 norm of the He vectors of u and v . As proved in (Cao et al , 2013), on the simple connected graph whose random walk one-step transition probability matrix is diagonalizable and ergodic as a Markov chain, the limit of DSD when k approaches infinity exists and can be calculated as limkDSD{k}(u,v)=||(bold-italicbbold-italicuTbold-italicbbold-italicvT)(IP+C)1|false|1, where I is the identity ma...…”
Section: Introductionmentioning
confidence: 99%
“…In fact, it is commonplace in the PPI network that regulation relationship, upstreamdownstream relations between interacting proteins when they are involved in signal transduction, transcriptional regulation, cell cycle or metabolism [20]. Moreover, it is reported by recent studies [21][22][23] that GBA is the exception rather than the rule in the PPI network and protein functions are determined by specific and critical interactions. Hence the relationship between interacting proteins may affect their functions and should be considered in the process of predicting protein functions.…”
Section: Introductionmentioning
confidence: 99%