SZTE-NLP at SemEval-2017 Task 10: A High Precision Sequence Model
            for Keyphrase Extraction Utilizing Sparse Coding for Feature
            Generation

Berend, G

doi:10.18653/v1/s17-2173

Cited by 2 publications

(2 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To place our results in perspective, using our email dataset we evaluate five previously introduced systems for keyword extraction. We chose two state-of-the-art unsupervised keyword extraction systems – SingleRank and ExpandRank (Wan and Xiao 2008; Hasan and Ng 2010), two top-performing systems in SemEval 2010 keyphrase extraction task (Kim et al 2010) – KX_FBK (Pianta and Tonelli 2010) and SZTERGAK (Berend and Farkas 2010; Berend 2011) and KEA (Witten et al 1999) – a well-known supervised keyword extractor 11 Table 13 shows the results obtained by these five systems, in comparison with our two best unsupervised methods, and our two supervised settings.…”

Section: Resultsmentioning

confidence: 99%

Keyword extraction from emails

2016

View full text Add to dashboard Cite

Emails constitute an important genre of online communication. Many of us are often faced with the daunting task of sifting through increasingly large amounts of emails on a daily basis. Keywords extracted from emails can help us combat such information overload by allowing a systematic exploration of the topics contained in emails. Existing literature on keyword extraction has not covered the email genre, and no human-annotated gold standard datasets are currently available. In this paper, we introduce a new dataset for keyword extraction from emails, and evaluate supervised and unsupervised methods for keyword extraction from emails. The results obtained with our supervised keyword extraction system (38.99% F-score) improve over the results obtained with the best performing systems participating in theSemEval2010 keyword extraction task.

show abstract

Section: Resultsmentioning

confidence: 99%

Keyword extraction from emails

2016

View full text Add to dashboard Cite

show abstract

“…Teams Overall A B C s2 end2end (Ammar et al, 2017) 0.43 0.55 0.44 0.28 TIAL UW 0.42 0.56 0.44 TTI COIN (Tsujimura et al, 2017) 0.38 0.5 0.39 0.21 PKU ICL (Wang and Li, 2017) 0.37 0.51 0.38 0.19 NTNU-1 0.33 0.47 0.34 0.2 WING-NUS (Prasad and Kan, 2017) 0.27 0.46 0.33 0.04 Know-Center (Kern et al, 2017) 0.27 0.39 0.28 SZTE-NLP (Berend, 2017) 0.26 0.35 0.28 NTNU (Lee et al, 2017b) 0.23 0.3 0.24 0.08 LABDA (Flores et al, 2017) 0.04 0.08 0.04 upper bound 0.84 0.85 0.85 0.77 random 0.00 0.03 0.01 0.00 former is surprising, as keyphrases are with an overwhelming majority noun phrases, the latter not as much, many keyphrases only appear once in the dataset (see Table 1). GMBUAP further tried using empirical rules obtained by observing the training data for Subtask A, and a Naive Bayes classifier trained on provided training data for Subtask B.…”

Section: Competitions/15898mentioning

confidence: 99%

SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications

Augenstein¹,

Das²,

Riedel³

et al. 2017

Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)

264

208

View full text Add to dashboard Cite

We describe the SemEval task of extracting keyphrases and relations between them from scientific documents, which is crucial for understanding which publications describe which processes, tasks and materials. Although this was a new task, we had a total of 26 submissions across 3 evaluation scenarios. We expect the task and the findings reported in this paper to be relevant for researchers working on understanding scientific content, as well as the broader knowledge base population and information extraction communities.

show abstract

SZTE-NLP at SemEval-2017 Task 10: A High Precision Sequence Model for Keyphrase Extraction Utilizing Sparse Coding for Feature Generation

Cited by 2 publications

References 7 publications

Keyword extraction from emails

Keyword extraction from emails

SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications

Contact Info

Product

Resources

About