2002
DOI: 10.1142/s0218213002000861
|View full text |Cite
|
Sign up to set email alerts
|

Comparing Keyword Extraction Techniques for Websom Text Archives

Abstract: The WEBSOM methodology for building very large text archives has a very slow method for extracting meaningful unit labels. This is due to the fact that the method computes for the relative frequencies of all the words of all the documents associated to each unit and then compares these to the relative frequencies of all the words of other units in the map. Since maps may have more than 100,000 units and the archieve may contain up to 7 million documents, the existing WEBSOM method is not practical. A fast alte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2002
2002
2010
2010

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 1 publication
0
1
0
Order By: Relevance
“…A set of noun words, verb words and unit symbols appearing in an article as ‗keywords' are defined here [19], [26]. The system adopts the frequency of the keyword [28,38] as its assigned weight [30].…”
Section: Keyword Weightingmentioning
confidence: 99%
“…A set of noun words, verb words and unit symbols appearing in an article as ‗keywords' are defined here [19], [26]. The system adopts the frequency of the keyword [28,38] as its assigned weight [30].…”
Section: Keyword Weightingmentioning
confidence: 99%