2020
DOI: 10.1016/j.ijhcs.2019.10.004
|View full text |Cite
|
Sign up to set email alerts
|

Everything you always wanted to know about a dataset: Studies in data summarisation

Abstract: Summarising data as text helps people make sense of it. It also improves data discovery, as search algorithms can match this text against keyword queries. In this paper, we explore the characteristics of text summaries of data in order to understand how meaningful summaries look like. We present two complementary studies: a data-search diary study with 69 students, which offers insight into the information needs of people searching for data; and a summarisation study, with a lab and a crowdsourcing component w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
36
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 27 publications
(37 citation statements)
references
References 79 publications
0
36
0
1
Order By: Relevance
“…Choosing a dataset greatly depends on the information provided alongside it. A number of studies indicate that standard metadata does not provide sufficient information for dataset reuse [81,106]. Recent studies have discussed textual ( [81,129]) or visual [138] surrogates of datasets that aim to help people identify relevant documents and increase accuracy and/or satisfaction with their relevance judgments.…”
Section: Results Presentationmentioning
confidence: 99%
See 4 more Smart Citations
“…Choosing a dataset greatly depends on the information provided alongside it. A number of studies indicate that standard metadata does not provide sufficient information for dataset reuse [81,106]. Recent studies have discussed textual ( [81,129]) or visual [138] surrogates of datasets that aim to help people identify relevant documents and increase accuracy and/or satisfaction with their relevance judgments.…”
Section: Results Presentationmentioning
confidence: 99%
“…Users judge the relevance of datasets for a specific task based on the dataset's scope (e.g. geographical and temporal scope) [104,75], basic statistics about the dataset such as counts and value ranges, and information about granularity of information in the data [81]. The documentation of variables and the context from which the dataset comes from also play a key role.…”
Section: Results Presentationmentioning
confidence: 99%
See 3 more Smart Citations