Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses

Grenander, Matt; Dong, Yue; Cheung, Jackie Chi Kit; Louis, Annie

doi:10.18653/v1/d19-1620

Cited by 31 publications

(32 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, abstracts are almost always available rather than behind paywalls like full text articles. For news summarisation, we used a state-of-the-art extractive model (Grenander et al, 2019) to extract sentences forming a summary of the original text. This model provides a summary de-biasing mechanism preventing it from focusing on specific parts of the full article, preserving the summary's informational authenticity as much as possible.…”

Section: Article Summarisationmentioning

confidence: 99%

The feasibility of convalescent plasma therapy in severe COVID- 19 patients: a pilot study

Duan

Liu

et al. 2020

Preprint

131

147

View full text Add to dashboard Cite

Currently, there are no approved specific antiviral agents for 2019 novel coronavirus disease . In this study, ten severe patients confirmed by real-time viral RNA test were enrolled prospectively. One dose of 200 mL convalescent plasma (CP) derived from recently recovered donors with the neutralizing antibody titers above 1:640 was transfused to the patients as an addition to maximal supportive care and antiviral agents. The primary endpoint was the safety of CP transfusion. The second endpoints were the improvement of clinical symptoms and laboratory parameters within 3 days after CP transfusion. The median time from onset of illness to CP transfusion was 16.5 days. After CP transfusion, the level of neutralizing antibody increased rapidly up to 1:640 in five cases, while that of the other four cases maintained at a high level (1:640). The clinical symptoms were significantly improved along with increase of oxyhemoglobin saturation within 3 days. Several parameters tended to improve as compared to pre-transfusion, including increased lymphocyte counts (0.65×10 9 /L vs. 0.76×10 9 /L) and decreased C-reactive protein (55.98 mg/L vs. 18.13 mg/L). Radiological examinations showed varying degrees of absorption of lung lesionswithin 7 days. The viral load was undetectable after transfusion in seven patients who had previous viremia. No severe adverse effects were observed. This study showed CP therapy was welltolerated and could potentially improve the clinical outcomes through neutralizing viremia in severe COVID-19 cases. The optimal dose and time point, as well as the clinical benefit of CP therapy, needs further investigation in larger well-controlled trials. Significance StatementCOVID-19is currently a big threat to global health. However, no specific antiviral agents are available for its treatment. In this work, we explored the feasibility of convalescent plasma (CP) transfusion to rescue severe patients. The results from 10 severe adult cases showed that one dose (200 mL) of CP was welltolerated and could significantly increase or maintain the neutralizing antibodies at a high level, leading to disappearance of viremia in 7 days. Meanwhile, clinical symptoms and paraclinical criteria rapidly improved within 3 days. Radiological examination All rights reserved. No reuse allowed without permission.

show abstract

Section: Article Summarisationmentioning

confidence: 99%

The feasibility of convalescent plasma therapy in severe COVID- 19 patients: a pilot study

Duan

Liu

et al. 2020

Preprint

131

147

View full text Add to dashboard Cite

show abstract

“…The high level of abstractiveness makes our dataset challenging since models cannot simply copy sentences from the reference articles. (Grenander et al, 2019). The extractive oracle performance indicates the level of "extractiveness" of each dataset.…”

Section: Dataset Creationmentioning

confidence: 99%

“…High positional and extractive biases can undesirably enable models to achieve high summarization scores by copying sentences from certain (fixed) positions, e.g. lead sentences in news summarization (Grenander et al, 2019;Narayan et al, 2018a). Empirical results show that our dataset is challenging and requires models having high-level of text abstractiveness.…”

Section: Introductionmentioning

confidence: 99%

Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles

Lu¹,

Dong²,

Charlin³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

Multi-document summarization is a challenging task for which there exists little largescale datasets. We propose Multi-XScience, a large-scale multi-document summarization dataset created from scientific articles. Multi-XScience introduces a challenging multidocument summarization task: writing the related-work section of a paper based on its abstract and the articles it references. Our work is inspired by extreme summarization, a dataset construction protocol that favours abstractive modeling approaches. Descriptive statistics and empirical results-using several state-of-the-art models trained on the Multi-XScience dataset-reveal that Multi-XScience is well suited for abstractive models. 1

show abstract

“…It may be tempting to apply neural abstractive summarization to meetings given its remarkable recent success on summarization benchmarks, e.g., CNN/DM (See et al, 2017;Chen and Bansal, 2018;Gehrmann et al, 2018;Laban et al, 2020). However, the challenge lies not only in handling hallucinations that are seen in abstractive models (Kryscinski et al, 2019;Lebanoff et al, 2019;Maynez et al, 2020) but also the models' strong positional bias that occurs as a consequence of fine-tuning on news articles (Kedzie et al, 2018;Grenander et al, 2019). Neural summarizers also assume a maximum sequence length, e.g., Perez-Beltrachini et al (2019) use the first 800 tokens of the document as input.…”

Section: Introductionmentioning

confidence: 99%

“…The pretraining data contain 160G of news, books, stories, and web text. It remains unclear if the model can effectively identify salient content on spoken text and, how it is to reduce lead bias that is not as frequent in spoken text as in news writing (Grenander et al, 2019). Secondly, a transcript can far exceed the maximum input length of the model, which is restricted by the GPU memory size.…”

Section: Introductionmentioning

confidence: 99%

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop

2021

View full text Add to dashboard Cite

show abstract

Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses

Cited by 31 publications

References 21 publications

The feasibility of convalescent plasma therapy in severe COVID- 19 patients: a pilot study

The feasibility of convalescent plasma therapy in severe COVID- 19 patients: a pilot study

Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop

Contact Info

Product

Resources

About