2010 Workshops on Database and Expert Systems Applications 2010
DOI: 10.1109/dexa.2010.31
|View full text |Cite
|
Sign up to set email alerts
|

Thesaurus Based Term Ranking for Keyword Extraction

Abstract: In many cases keywords from a restricted set of possible keywords have to be assigned to texts. A common way to find the best keywords is to rank terms occurring in the text according to their tf.idf value. This requires a corpus of texts from which document frequencies can be derived. In this paper we show that we can obtain results of the same quality without the usage of a background corpus, using relations between terms provided in a thesaurus.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(6 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…Some methods utilise external knowledge to provide useful contextual information for extracting more semantically sensible keywords. For instance, Li and Wang (2014) proposed a TextRank-based AKE method that benefits from domain knowledge by using author-assigned keywords of scientific publications, and Gazendam et al (2010) proposed to use semantic relations between thesaurus terms for ranking candidate keywords without a reference corpus (Gazendam et al, 2010). Thesaurus relations have also been combined with machine learning techniques to improve performance of AKE methods (Hulth et al, 2001;Medelyan and Witten, 2006).…”
Section: Pos-tagging and Semantics In Akementioning
confidence: 99%
“…Some methods utilise external knowledge to provide useful contextual information for extracting more semantically sensible keywords. For instance, Li and Wang (2014) proposed a TextRank-based AKE method that benefits from domain knowledge by using author-assigned keywords of scientific publications, and Gazendam et al (2010) proposed to use semantic relations between thesaurus terms for ranking candidate keywords without a reference corpus (Gazendam et al, 2010). Thesaurus relations have also been combined with machine learning techniques to improve performance of AKE methods (Hulth et al, 2001;Medelyan and Witten, 2006).…”
Section: Pos-tagging and Semantics In Akementioning
confidence: 99%
“…External knowledge based approaches use external knowledge sources such as terminological databases: GRISP, MeSH, etc. [5, 1821], linguistic ressources (WordNet, etc. [22, 23]), article databanks (Wikipedia, etc.…”
Section: Related Workmentioning
confidence: 99%
“…However, these methods do not consider various aspects of words, so the chosen keywords may cover the content of documents in limited aspects. There are also some other types of research that use external data such as a thesaurus or ontology [2022]. A thesaurus and ontology can be a good source as a secondary knowledge for extracting keywords, but they may carry a high cost of construction.…”
Section: Related Workmentioning
confidence: 99%