2012
DOI: 10.1007/s10115-012-0480-2
|View full text |Cite
|
Sign up to set email alerts
|

An automatic keyphrase extraction system for scientific documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 45 publications
(23 citation statements)
references
References 26 publications
0
23
0
Order By: Relevance
“…We use four benchmark datasets shown in Table 1 for empirical observations and comparisons. These datasets have been used extensively to evaluate keyword extraction algorithms [4,19,30,33,35,43]. Table 1 presents general properties of the four datasets, including number of documents in corpus, average document length, average number of gold-standard keywords along with standard deviation, and average percentage of candidate keywords.…”
Section: Methodsmentioning
confidence: 99%
“…We use four benchmark datasets shown in Table 1 for empirical observations and comparisons. These datasets have been used extensively to evaluate keyword extraction algorithms [4,19,30,33,35,43]. Table 1 presents general properties of the four datasets, including number of documents in corpus, average document length, average number of gold-standard keywords along with standard deviation, and average percentage of candidate keywords.…”
Section: Methodsmentioning
confidence: 99%
“…According to the results reported in [5], the use of the feature indicating phrase position at the beginning of a document works for academic papers and does not lead to better performance in case of book chapters and scientific webpages that do not have an abstract. In our research we show that precisely at the beginning of a scientific publication, in an abstract section, which is not included in a literary text, the major part of keyphrases is gathered.…”
Section: Introduction and Related Workmentioning
confidence: 85%
“…This fact should also be considered, as it allows to generate less candidate phrases during processing. In [5] it was mentioned that too many candidates negatively influence ranking and one of the most important tasks consists in elaboration of algorithms for the construction of small candidate sets.…”
Section: Introduction and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…to be part of a noun phrase, as was the case in the study by Barker and Cornacchia [5]. You et al [124] used the so-called core word expansion algorithm, which first finds a set of core words and the final set of candidate phrases are generated from these seed phrases. They claimed that their method might reduce the candidate set by about 75%.…”
Section: Generation Of Keyphrase Candidatesmentioning
confidence: 99%