2021
DOI: 10.1177/08944393211012268
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Crisis in Social Media Analytics: A Solution for the Data-Sharing Problem

Abstract: Computational social science uses computational and statistical methods in order to evaluate social interaction. The public availability of data sets is thus a necessary precondition for reliable and replicable research. These data allow researchers to benchmark the computational methods they develop, test the generalizability of their findings, and build confidence in their results. When social media data are concerned, data sharing is often restricted for legal or privacy reasons, which makes the comparison … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

4
4

Authors

Journals

citations
Cited by 15 publications
(12 citation statements)
references
References 87 publications
0
12
0
Order By: Relevance
“…The first issue – that is, the limited availability of reference datasets – can be traced back to a combination of long‐known and covid‐related challenges. Firstly, building high‐quality scientific datasets have always represented a very demanding task [ 98 ]. In addition, the impact and the recency of the pandemic left even less time and resources for scholars to tackle this task.…”
Section: Discussion: Challenges and Future Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…The first issue – that is, the limited availability of reference datasets – can be traced back to a combination of long‐known and covid‐related challenges. Firstly, building high‐quality scientific datasets have always represented a very demanding task [ 98 ]. In addition, the impact and the recency of the pandemic left even less time and resources for scholars to tackle this task.…”
Section: Discussion: Challenges and Future Directionsmentioning
confidence: 99%
“…High quality and reference datasets represent an important resource to foster research and experimentation on novel scientific issues [ 97 ]. However, building such resources is notoriously challenging and time‐demanding [ 98 ]. In the case of COVID‐19 phishing attacks, publicly available reference datasets are few and far between.…”
Section: Countermeasuresmentioning
confidence: 99%
“…For these reasons, some of the content observed by Graham and Keller was expected to be missing from our dataset. This lack of consistency between social media datasets for comparative analyses is a growing challenge recently identified in the benchmarking literature (Assenmacher et al 2021 ).…”
Section: The Data and Its Timelinementioning
confidence: 99%
“…For these reasons, some of the content observed by Graham and Keller were expected to be missing from our dataset. This lack of consistency between social media datasets for comparative analyses is a growing challenge recently identified in the benchmarking literature (Assenmacher et al, 2021). A final contrast dataset was obtained via the Real-Time Analytics Platform for Interactive Data Mining (RAPID) (Lim et al, 2018) consisting of tweets containing the term '#brexit' during the same period.…”
Section: The Dataset and Its Timelinementioning
confidence: 99%