A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries

Baruah, Gaurav; McCreadie, Richard; Lin, Jimmy

doi:10.1145/3132847.3133000

Cited by 4 publications

(11 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Evaluation of summarization algorithms is traditionally done using ROUGE scores (based on unigram/bigram overlap with gold standard summaries), but these measures are not sufficient for timeline summarization methods. Nugget-based or cluster-based evaluation methods have recently been shown to be more effective, but they require lot of annotation effort (Baruah et al 2017).…”

Section: Rc3: Summarization Of Social Media Content Streamsmentioning

confidence: 99%

Exploitation of Social Media for Emergency Relief and Preparedness: Recent Research and Trends

Ghosh

Ganguly

et al. 2018

Inf Syst Front

View full text Add to dashboard Cite

Online Social Media, such as Twitter, Facebook and WhatsApp, are important sources of real-time information related to emergency events, including both natural calamities, man-made disasters, epidemics, and so on. There has been lot of recent work on designing information systems that would be useful for aiding post-disaster relief operations, as well as for pre-disaster preparedness. A special issue on BExploitation of Social Media for Emergency Relief and Preparedness^was conducted for the journal Information Systems Frontiers. The objective of this special issue was to present a platform for dissemination of the empirical results of various technologies for extracting vital and actionable information from social media content in disaster situations. The papers included in this issue are expected to be the stepping stones for future explorations and technical innovations towards technologies meant for utilizing various online and offline information sources for enhancing pre-disaster preparedness and post-disaster relief operations.

show abstract

Section: Rc3: Summarization Of Social Media Content Streamsmentioning

confidence: 99%

Exploitation of Social Media for Emergency Relief and Preparedness: Recent Research and Trends

Ghosh

Ganguly

et al. 2018

Inf Syst Front

View full text Add to dashboard Cite

show abstract

“…of systems. As a result, it is unclear to what extent the test collections produced during these tracks can be used to evaluate the quality of new systems that were not pooled for judging [5].…”

Section: Timestampmentioning

confidence: 99%

“…Second as information clusters within the TREC Real-time Summarization track during 2016 and 2017. We choose to use the TREC Temporal Summarization implementation as the basis for the study in this paper as it is the more complex/costly to deploy of the two (due to the more fine-grained definition of atomic information units used) and because it enables a more detailed comparison of systems [5]. We discuss this implementaton below.…”

Section: Timeline Summaries and Evaluationmentioning

confidence: 99%

On enhancing the robustness of timeline summarization test collections

McCreadie

Rajput²,

Soboroff³

et al. 2019

Information Processing & Management

Self Cite

View full text Add to dashboard Cite

“…More precisely, events [1][2][3][4][5][6][7][8][9][10] 18 are assigned to a 'TTG-2013' label set, events 11-25 to a 'TTG-2014' label set and events 26-46 to a 'TTG-2015' label set. We use these label sets later to provide an approximate comparison of the performance of our proposed approaches to the TREC best participating systems for each year.…”

Section: 'Trects-201x' and 'Ttg-201x' Label Setsmentioning

confidence: 99%

“…In particular, the labeling methodologies used to create the nuggets and matches, the interfaces and support tools used to do the matching, as well as the assessor profiles differ between the TREC-TS original assessments ('TREC-TS-201X' label sets) and the label sets derived from 'TTG-All' ('TTG-201X' label sets). For those interested in examining the differences between these methodologies in more detail, we recommend reading the study by Baruah et al [4].…”

Section: 'Trects-201x' and 'Ttg-201x' Label Setsmentioning

confidence: 99%

Explicit Diversification of Event Aspects for Temporal Summarization

McCreadie

Santos

Macdonald

et al. 2018

ACM Trans. Inf. Syst.

Self Cite

View full text Add to dashboard Cite

During major events, such as emergencies and disasters, a large volume of information is reported on newswire and social media platforms. Temporal summarization (TS) approaches are used to automatically produce concise overviews of such events, by extracting text snippets from related articles over time. Current TS approaches rely on a combination of event relevance and textual novelty for snippet selection. However, for events that span multiple days, textual novelty is often a poor criterion for selecting snippets, since many snippets are textually unique, but are semantically redundant or non-informative. In this article, we propose a framework for the diversification of snippets using explicit event aspects, building upon recent works in search result diversification. In particular, we first propose two techniques to identify explicit aspects that a user might want to see covered in a summary for different types of event. We then extend a state-of-theart explicit diversification framework to maximize the coverage of these aspects when selecting summary snippets for unseen events. Through experimentation over the TREC TS 2013, 2014 and 2015 datasets, we show that explicit diversification for temporal summarization significantly outperforms classical noveltybased diversification, as the use of explicit event aspects reduces the amount of redundant and off-topic snippets returned, while also increasing summary timeliness.

show abstract

A Comparison of Nuggets and Clusters for Evaluating Timeline Summaries

Cited by 4 publications

References 23 publications

Exploitation of Social Media for Emergency Relief and Preparedness: Recent Research and Trends

Exploitation of Social Media for Emergency Relief and Preparedness: Recent Research and Trends

On enhancing the robustness of timeline summarization test collections

Explicit Diversification of Event Aspects for Temporal Summarization

Contact Info

Product

Resources

About