Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1618
|View full text |Cite
|
Sign up to set email alerts
|

SUM-QE: a BERT-based Summary Quality Estimation Model

Abstract: We propose SUM-QE, a novel Quality Estimation model for summarization based on BERT. The model addresses linguistic quality aspects that are only indirectly captured by content-based approaches to summary evaluation, without involving comparison with human references. SUM-QE achieves very high correlations with human ratings, outperforming simpler models addressing these linguistic aspects. Predictions of the SUM-QE model can be used for system development, and to inform users of the quality of automatically p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
36
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 36 publications
(38 citation statements)
references
References 19 publications
1
36
0
Order By: Relevance
“…• SIMetrix (Louis and Nenkova, 2009) • SumQE (Xenouleas et al, 2019) Among these metrics, 6 have original implementations in Java, 6 in Python, 1 in Perl, and 1 with no known official implementation (Pyramid Score).…”
Section: The Metric Interfacementioning
confidence: 99%
“…• SIMetrix (Louis and Nenkova, 2009) • SumQE (Xenouleas et al, 2019) Among these metrics, 6 have original implementations in Java, 6 in Python, 1 in Perl, and 1 with no known official implementation (Pyramid Score).…”
Section: The Metric Interfacementioning
confidence: 99%
“…Some work discussed how to evaluate the quality of generated text in the reference-free setting (Louis and Nenkova, 2013;Peyrard et al, 2017;Peyrard and Gurevych, 2018;Shimanaka et al, 2018;Xenouleas et al, 2019;Sun and Nenkova, 2019;Böhm et al, 2019;Chen et al, 2018;Gao et al, 2020). Louis and Nenkova (2013), Peyrard et al (2017) and Peyrard and Gurevych (2018) leveraged regression models to fit human judgement.…”
Section: Reference-free Metricsmentioning
confidence: 99%
“…RUSE (Shimanaka et al, 2018) use sentence embeddings generated by three different models and aggregate them using a MLP regressor. Xenouleas et al (2019) proposed a method that also uses a regression model to predict the scores, while the predictions are based on hidden representations generated using BERT (Devlin et al, 2019) as the encoder. However, these methods require ratings assigned by human annotators as training data which are also costly to obtain.…”
Section: Reference-free Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…One possible route to a better automatic method for summary quality estimation is to train a model on document summaries annotated with human quality scores Nenkova, 2009, 2013;Xenouleas et al, 2019). Such a model could be used to evaluate summaries without further human involvement.…”
Section: Introductionmentioning
confidence: 99%