Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2016
DOI: 10.18653/v1/p16-1178
|View full text |Cite
|
Sign up to set email alerts
|

Linguistic Benchmarks of Online News Article Quality

Abstract: Online news editors ask themselves the same question many times: what is missing in this news article to go online? This is not an easy question to be answered by computational linguistic methods. In this work, we address this important question and characterise the constituents of news article editorial quality. More specifically, we identify 14 aspects related to the content of news articles. Through a correlation analysis, we quantify their independence and relation to assessing an article's editorial quali… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
16
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(17 citation statements)
references
References 26 publications
(26 reference statements)
1
16
0
Order By: Relevance
“…In the works that study bias, Arapakis et al (2016) collected a dataset of 561 news articles, each being labeled with 14 qualitative aspects along with article's subjectivity. Another dataset, the multi-perspective question answering (MPQA) corpus (Wiebe et al, 2005), contains 692 news articles, each with a label of its subjectivity.…”
Section: Political Bias Datasetsmentioning
confidence: 99%
“…In the works that study bias, Arapakis et al (2016) collected a dataset of 561 news articles, each being labeled with 14 qualitative aspects along with article's subjectivity. Another dataset, the multi-perspective question answering (MPQA) corpus (Wiebe et al, 2005), contains 692 news articles, each with a label of its subjectivity.…”
Section: Political Bias Datasetsmentioning
confidence: 99%
“…Beautiful phrasing: High-quality articles are often written using beautiful language [2,22], meaning more unusual phrasing and creative words that lead to more positive feedback [34]. We used Term Frequency-Inverse Document Frequency (tf•idf) as a proxy, because the use of rarer words can be seen as an indicator for the use of beautiful language.…”
Section: Stylementioning
confidence: 99%
“…Genre: The topic of the article reflects certain characteristics of its nature and it has been greatly investigated in previous work [2,22].…”
Section: Context Featuresmentioning
confidence: 99%
“…Bias Analysis The analysis of media bias has been a subject of investigation for decades (Groseclose and Milyo, 2005;Fang et al, 2012;Arapakis et al, 2016). Various aspects of bias have been studied from different perspectives.…”
Section: Related Workmentioning
confidence: 99%
“…Bias Datasets To study the bias in the newspaper domain, several developed corpora include one or more label types related to bias. For example, the news quality corpus created by Arapakis et al (2016) comprises 561 articles, each of which being labeled with 14 different quality aspects including article's subjectivity. Also, the MPQA corpus contains a label for the subjectivity of its 692 news articles (Wiebe et al, 2005).…”
Section: Related Workmentioning
confidence: 99%