Sander Wubben scite author profile

Currently, there is little agreement as to how Natural Language Generation (NLG) systems should be evaluated, with a particularly high degree of variation in the way that human evaluation is carried out. This paper provides an overview of how human evaluation is currently conducted, and presents a set of best practices, grounded in the literature. With this paper, we hope to contribute to the quality and consistency of human evaluations in NLG.

show abstract

NeuralREG: An end-to-end approach to referring expression generation

Ferreira

Moussallem

Kádár

et al. 2018

View full text Add to dashboard Cite

Traditionally, Referring Expression Generation (REG) models first decide on the form and then on the content of references to discourse entities in text, typically relying on features such as salience and grammatical function. In this paper, we present a new approach (NeuralREG), relying on deep neural networks, which makes decisions about form and content in one go without explicit feature extraction. Using a delexicalized version of the WebNLG corpus, we show that the neural model substantially improves over two strong baselines. Data and models are publicly available 1 .

show abstract

Linguistic realisation as machine translation: Comparing different MT models for AMR-to-text generation

Ferreira¹,

Calixto²,

Wubben³

et al. 2017

View full text Add to dashboard Cite

In this paper, we study AMR-to-text generation, framing it as a translation task and comparing two different MT approaches (Phrasebased and Neural MT). We systematically study the effects of 3 AMR preprocessing steps (Delexicalisation, Compression, and Linearisation) applied before the MT phase. Our results show that preprocessing indeed helps, although the benefits differ for the two MT models. The implementation of the models are publicly available 1 .

show abstract

Towards more variation in text generation: Developing and evaluating variation models for choice of referential form

Ferreira¹,

Krahmer²,

Wubben³

2016

View full text Add to dashboard Cite

In this study, we introduce a nondeterministic method for referring expression generation. We describe two models that account for individual variation in the choice of referential form in automatically generated text: a Naive Bayes model and a Recurrent Neural Network. Both are evaluated using the VaREG corpus. Then we select the best performing model to generate referential forms in texts from the GREC-2.0 corpus and conduct an evaluation experiment in which humans judge the coherence and comprehensibility of the generated texts, comparing them both with the original references and those produced by a random baseline model.

show abstract

Enriching the WebNLG corpus

Ferreira¹,

Moussallem²,

Krahmer³

et al. 2018

View full text Add to dashboard Cite

This paper describes the enrichment of WebNLG corpus (Gardent et al., 2017a,b), with the aim to further extend its usefulness as a resource for evaluating common NLG tasks, including Discourse Ordering, Lexicalization and Referring Expression Generation. We also produce a silverstandard German translation of the corpus to enable the exploitation of NLG approaches to other languages than English. The enriched corpus is publicly available 1 .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Sander Wubben

Best practices for the human evaluation of automatically generated text

NeuralREG: An end-to-end approach to referring expression generation

Linguistic realisation as machine translation: Comparing different MT models for AMR-to-text generation

Towards more variation in text generation: Developing and evaluating variation models for choice of referential form

Enriching the WebNLG corpus

Contact Info

Product

Resources

About