Proceedings of the 2008 SIAM International Conference on Data Mining 2008
DOI: 10.1137/1.9781611972788.7
|View full text |Cite
|
Sign up to set email alerts
|

Maximal Quasi-Bicliques with Balanced Noise Tolerance: Concepts and Co-clustering Applications

Abstract: The rigid all-versus-all adjacency required by a maximal biclique for its two vertex sets is extremely vulnerable to missing data. In the past, several types of quasi-bicliques have been proposed to tackle this problem, however their noise tolerance is usually unbalanced and can be very skewed. In this paper, we improve the noise tolerance of maximal quasi-bicliques by allowing every vertex to tolerate up to the same number, or the same percentage, of missing edges. This idea leads to a more natural interactio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2009
2009
2014
2014

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 33 publications
(21 citation statements)
references
References 29 publications
0
21
0
Order By: Relevance
“…Semiclique mining of the co-occurrence graph, as we have described, is just one way of inducing candidate sets and partial sets. 11 One can envision methods that, similar to ngram extraction, require a few passes over the corpus, and mine for statistically significant sets of co-occurring terms.…”
Section: Discussion On Semiclique Generationmentioning
confidence: 99%
See 1 more Smart Citation
“…Semiclique mining of the co-occurrence graph, as we have described, is just one way of inducing candidate sets and partial sets. 11 One can envision methods that, similar to ngram extraction, require a few passes over the corpus, and mine for statistically significant sets of co-occurring terms.…”
Section: Discussion On Semiclique Generationmentioning
confidence: 99%
“…Here, it is very important to allow for overlapping groupings to support multiple senses and events, unlike much (graph) clustering work. On the other hand, unlike some applications [11], we do not require enumerating all the maximal semicliques. As we mentioned, the co-occurrence graph of terms is best viewed as hierarchical, reflecting term frequencies, not flat.…”
Section: Related Workmentioning
confidence: 99%
“…Thus, these methods cannot be used to find concept graphs since, e.g., the members of a concept graph need not be connected at all. Methods [11] have been proposed for quasi-biclique detection, which explicitly distinguish between members and aspects. The existing methods, however, are not aware of the concept of cue validity and are not able to determine representative vertices for each concept.…”
Section: Related Workmentioning
confidence: 99%
“…In [22], we introduce an alternate version of maximal quasibiclique whose error tolerance is percentage based. As this alternate version does not have anti-monotone property, there is no efficient algorithm to mine it.…”
Section: A Graphmentioning
confidence: 99%