2018
DOI: 10.1186/s13040-018-0172-x
|View full text |Cite
|
Sign up to set email alerts
|

Soft document clustering using a novel graph covering approach

Abstract: BackgroundIn text mining, document clustering describes the efforts to assign unstructured documents to clusters, which in turn usually refer to topics. Clustering is widely used in science for data retrieval and organisation.ResultsIn this paper we present and discuss a novel graph-theoretical approach for document clustering and its application on a real-world data set. We will show that the well-known graph partition to stable sets or cliques can be generalized to pseudostable sets or pseudocliques. This al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 38 publications
(42 reference statements)
0
2
0
Order By: Relevance
“…Clustering is usually not perceived as a graph problem, although several attempts have been made (e.g. [30]) and here we will show how to generalize it on knowledge graphs. Usually the problem can be formulated in the following way: Given a similarity function for the document or data space D as sim : D × D → R + and an ǫ ∈ R + .…”
Section: Document or Data Clusteringmentioning
confidence: 99%
“…Clustering is usually not perceived as a graph problem, although several attempts have been made (e.g. [30]) and here we will show how to generalize it on knowledge graphs. Usually the problem can be formulated in the following way: Given a similarity function for the document or data space D as sim : D × D → R + and an ǫ ∈ R + .…”
Section: Document or Data Clusteringmentioning
confidence: 99%
“…Text clustering which is also called document clustering [59] is multi disciplineary clustering technique based on information retrieval, natural language processing and machine learning [38]. In document clustering, document collections are grouped together where the same documents in the group have similar topics [39].…”
Section: Text Processingmentioning
confidence: 99%