2004
DOI: 10.1081/bip-200025659
|View full text |Cite
|
Sign up to set email alerts
|

A Knowledge-Based Clustering Algorithm Driven by Gene Ontology

Abstract: We have developed an algorithm for inferring the degree of similarity between genes by using the graph-based structure of Gene Ontology (GO). We applied this knowledge-based similarity metric to a clique-finding algorithm for detecting sets of related genes with biological classifications. We also combined it with an expression-based distance metric to produce a co-cluster analysis, which accentuates genes with both similar expression profiles and similar biological characteristics and identifies gene clusters… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
59
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 96 publications
(59 citation statements)
references
References 19 publications
0
59
0
Order By: Relevance
“…1 , although oxygen binding and ion binding are both at a depth of 2, the former is a more specifi c concept and is actually a leaf node. More recent approaches attempt at mitigating some of these issues using for instance the depth of the lowest common ancestor (LCA) [ 11 ], distance to nearest leaf node [ 12 ], and depth of distinct GO subgraphs [ 1 ]. Related approaches, also based on the structure of the ontology, combine distance metrics with node structural properties, such as number of subclasses and distance to the lowest common ancestor between the terms [ 13 ].…”
Section: Ss Measuresmentioning
confidence: 99%
“…1 , although oxygen binding and ion binding are both at a depth of 2, the former is a more specifi c concept and is actually a leaf node. More recent approaches attempt at mitigating some of these issues using for instance the depth of the lowest common ancestor (LCA) [ 11 ], distance to nearest leaf node [ 12 ], and depth of distinct GO subgraphs [ 1 ]. Related approaches, also based on the structure of the ontology, combine distance metrics with node structural properties, such as number of subclasses and distance to the lowest common ancestor between the terms [ 13 ].…”
Section: Ss Measuresmentioning
confidence: 99%
“…Edge-based methods are intuitive, among which (Pekar and Staab, 2002) and (Cheng et al, 2004) are two representative ones. Suppose t 1 and t 2 are two terms, and t is their lowest common ancestor.…”
Section: Measuring Semantic Similarity On Gomentioning
confidence: 99%
“…The state-of-the-art methods for specifying semantic similarity over the GO terms can be divided into three groups: edge-based, node-based, and a hybrid of the above two. For the edge-based approaches, they mainly consider the lengths of the paths connecting the terms (Cheng et al, 2004;Pekar and Staab, 2002) as the distance between the terms. For the node-based methods, they rely on the properties of the terms derived from information theory (Jiang and Conrath, 1997;Lin, 1998;Resnik, 1999;Schlicker et al, 2006).…”
Section: Semantic Similarity On Gomentioning
confidence: 99%
See 1 more Smart Citation
“…These measures also have the "shallow annotation" drawback [6][7][8]: two terms with a certain distance near the root have equal semantic similarity with two terms with the same distance but far from the root. Other edge-based measures [2,9] have attempted to overcome this limitation by assigning different weights to the edges at different graph levels using network density, but they still ignored one fact: GO terms at the same level do not always share same specificity because two terms in the same level can have different gene properties.…”
Section: Introductionmentioning
confidence: 99%