2021
DOI: 10.1609/icwsm.v15i1.18127
|View full text |Cite
|
Sign up to set email alerts
|

Media Cloud: Massive Open Source Collection of Global News on the Open Web

Abstract: We present the first full description of Media Cloud, an open source platform based on crawling hyperlink structure in operation for over 10 years, that for many uses will be the best way to collect data for studying the media ecosystem on the open web. We document the key choices behind what data Media Cloud collects and stores, how it processes and organizes these data, and its open API access as well as user-facing tools. We also highlight the strengths and limitations of the Media Cloud collection strategy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 36 publications
0
14
0
Order By: Relevance
“…Figure 8: Dynamics of mean centrality measures in the opinion co-occurrence network for conspiracy (red lines) and non-conspiracy opinions (gray lines). The green line shows the news coverage ratios from Media Cloud (Roberts et al 2021). The highlighted area shows a spike in the news coverage, which coincides with a decrease in the centrality of conspiracy opinions.…”
Section: Centrality Dynamics In Opinion Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 8: Dynamics of mean centrality measures in the opinion co-occurrence network for conspiracy (red lines) and non-conspiracy opinions (gray lines). The green line shows the news coverage ratios from Media Cloud (Roberts et al 2021). The highlighted area shows a spike in the news coverage, which coincides with a decrease in the centrality of conspiracy opinions.…”
Section: Centrality Dynamics In Opinion Networkmentioning
confidence: 99%
“…We also depict the attention dedicated by the Australian news media to the bushfires during the same period. We estimate the latter using the news coverage ratio -the percentage of articles dedicated to the topic over all captured articles in a day -crawled using the Media Cloud (Roberts et al 2021).…”
Section: Centrality Dynamics In Opinion Networkmentioning
confidence: 99%
“…We used the an open-source platform for media analysis MediaCloud available at https://mediacloud.org/ (Roberts et al 2021), searching the geographical collection United Kingdom -National, that includes most UK media sources. We performed a Boolean search including all names of the GBD or JSM list and the terms "COVID-19 or coronavirus or pandemic" within the date range 1/1/2020-20/10/2021.…”
Section: Identification Of Uk News Articles Mentioning Gbd or Jsm Nam...mentioning
confidence: 99%
“…We used the open-source platform for media analysis MediaCloud available at https://mediacloud.org/ [35], searching the geographical collection "U.S. Top Newspapers 2018" or the collection "Italy -National" for the date range 1/1/2020-20/10/2021. In English, we performed seven independent searches of these terms referring to COVID-19 interventions: convalescent plasma, hydroxychloroquine, ivermectin, lockdown, mask, vaccine and vitamin D; these were combined in a Boolean AND search with "covid AND (published OR publication OR journal)".…”
Section: Data Collectionmentioning
confidence: 99%