2018
DOI: 10.1007/978-3-030-01159-8_30
|View full text |Cite
|
Sign up to set email alerts
|

New/s/leak 2.0 – Multilingual Information Extraction and Visualization for Investigative Journalism

Abstract: Investigative journalism in recent years is confronted with two major challenges: 1) vast amounts of unstructured data originating from large text collections such as leaks or answers to Freedom of Information requests, and 2) multi-lingual data due to intensified global cooperation and communication in politics, business and civil society. Faced with these challenges, journalists are increasingly cooperating in international networks. To support such collaborations, we present the new version of new/s/leak 2.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 10 publications
0
5
0
Order By: Relevance
“…For example, netflower is a visual exploration tool that supports journalists in the analysis of quantitative data flows [16]. Similarly, the tool newsleak [17] was designed to support investigative journalists to make sense of leak data.…”
Section: Data Visualization In Journalismmentioning
confidence: 99%
“…For example, netflower is a visual exploration tool that supports journalists in the analysis of quantitative data flows [16]. Similarly, the tool newsleak [17] was designed to support investigative journalists to make sense of leak data.…”
Section: Data Visualization In Journalismmentioning
confidence: 99%
“…In investigative journalism, tools like New/s/leak 2.0 [103], as used by Der SPIEGEL, use models trained on public data like Wikipedia for discovering named entities in textual data (e.g., persons or company names). Similarity, the industry-standard spacy [48] uses public corpora and increasingly open web information for model training.…”
Section: Digital Communication Analysis and Employed Technologymentioning
confidence: 99%
“…One of the most prevailing domains for such systems is the analysis of communications data for intelligence purposes, namely in criminal investigations, lawsuits, matters of national and international security and in investigative journalism. Specialized systems are used, for example, by the National Security Agency (NSA) as part of global spying operations [77] or law enforcement against organized crime [23], by lawyers for analyzing case-relevant documents [2], but also by journalists working on [103] large data leaks such as the Panama Papers. During these operations, large amounts of communication data, like e-mails, chats, posts, or calls are collected, along with associated documents (e.g., attachments) and meta-data like timestamps, locations, and contact networks.…”
mentioning
confidence: 99%
“…Due to the micro-service architecture, which communicates over HTTP, our NER docker container can be easily be used in parallel NLP processing chains. We use it, for instance, in the information extraction pipeline of our "new/s/leak" project (Wiedemann et al, 2018), to create visualizations of co-occurrence networks of named entities.…”
Section: Micro-servicementioning
confidence: 99%