The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2000
DOI: 10.1016/s1464-1895(00)00100-9
|View full text |Cite
|
Sign up to set email alerts
|

Interactive text retrieval based on document similarities

Abstract: Abstract.In this article we present a prototypical implementation of a software tool for document retrieval which groups/arranges (pre-processed) documents based on a similarity measure. The prototype was developed based on self-organising maps to realise interactive associative search and visual exploration of document databases. This helps a user to navigate through similar documents. The navigation, especially the search for the first appropriate document, is supported by conventional keyword search methods… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
13
0
1

Year Published

2003
2003
2013
2013

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 25 publications
(14 citation statements)
references
References 7 publications
(7 reference statements)
0
13
0
1
Order By: Relevance
“…Empirical results showing that term entropy is good for index term selection can be found in [68]. Thus, we use term entropy as a term weighting method for highlighting appropriate terms in representing a time partition.…”
Section: Temporal Entropymentioning
confidence: 99%
“…Empirical results showing that term entropy is good for index term selection can be found in [68]. Thus, we use term entropy as a term weighting method for highlighting appropriate terms in representing a time partition.…”
Section: Temporal Entropymentioning
confidence: 99%
“…Well known examples are Vivisimo [42] and Grokker [7] Although their underlying clustering logics are not fully disclosed, documents and evidences imply that these systems share many features with such known research systems as the Grouper system [21,2] and Lingo/Carrot Search [12,15]. Recently a new algorithm to classify search results was proposed.…”
Section: Crm and Document Clusteringmentioning
confidence: 99%
“…A simple but very efficient method in this direction is to extract keywords based on their entropy. For instance, in the approach discussed in [18], for each word k in the vocabulary the entropy as defined by [22] was computed:…”
Section: Methodsmentioning
confidence: 99%
“…That is, of two words occurring equally often the one with the higher entropy is preferred. Empirically this procedure has proven to yield a set of relevant words that are suited to serve as index terms [18]. In order to obtain a fixed number of terms that cover the document collection well, we applied a greedy strategy: from an arbitrary document in the collection select the term with the highest relative entropy as an index term.…”
Section: Methodsmentioning
confidence: 99%