Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the ACL - ACL '06 2006
DOI: 10.3115/1220175.1220243
|View full text |Cite
|
Sign up to set email alerts
|

A study on automatically extracted keywords in text categorization

Abstract: This paper presents a study on if and how automatically extracted keywords can be used to improve text categorization. In summary we show that a higher performance -as measured by micro-averaged F-measure on a standard text categorization collection -is achieved when the full-text representation is combined with the automatically extracted keywords. The combination is obtained by giving higher weights to words in the full-texts that are also extracted as keywords. We also present results for experiments in whi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
74
0
1

Year Published

2010
2010
2019
2019

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 129 publications
(75 citation statements)
references
References 13 publications
(12 reference statements)
0
74
0
1
Order By: Relevance
“…In other words, its goal is to extract a set of phrases that are related to the main topics discussed in a given document (Tomokiyo and Hurst, 2003;Liu et al, 2009b;Ding et al, 2011;Zhao et al, 2011). Document keyphrases have enabled fast and accurate searching for a given document from a large text collection, and have exhibited their potential in improving many natural language processing (NLP) and information retrieval (IR) tasks, such as text summarization (Zhang et al, 2004), text categorization (Hulth and Megyesi, 2006), opinion mining (Berend, 2011), and document indexing .…”
Section: Introductionmentioning
confidence: 99%
“…In other words, its goal is to extract a set of phrases that are related to the main topics discussed in a given document (Tomokiyo and Hurst, 2003;Liu et al, 2009b;Ding et al, 2011;Zhao et al, 2011). Document keyphrases have enabled fast and accurate searching for a given document from a large text collection, and have exhibited their potential in improving many natural language processing (NLP) and information retrieval (IR) tasks, such as text summarization (Zhang et al, 2004), text categorization (Hulth and Megyesi, 2006), opinion mining (Berend, 2011), and document indexing .…”
Section: Introductionmentioning
confidence: 99%
“…To solve this labeling issue, some studies, based on text summary [17] and key phrase extractionapproaches [24]identify text portions or key phrases according to their major theme [8]. Other methods focus on the identification of text portions related to the document title [29].…”
Section: Related Workmentioning
confidence: 99%
“…We then use these new terms directly, or broken down into single terms (in case of multiword terms). This last feature is motivated by [10], who showed improved document classification results after breaking down multiwords for partial matches. In summary, we use the following four types of semantic features:…”
Section: Semantic Informationmentioning
confidence: 99%