Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia

Wong, KayYen; Реди, Мириам; Sáez-Trumper, Diego

doi:10.1145/3404835.3463253

Cited by 5 publications

(4 citation statements)

References 11 publications

(16 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous studies on quality estimation in Wikipedia have mainly focused on the article-level (Mola-Velasco 2011; Bykau et al 2015;Wong, Redi, and Saez-Trumper 2021;Asthana et al 2021). They are aimed at estimating the quality of revisions and articles.…”

Section: Related Workmentioning

confidence: 99%

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Ando,

Sekine,

Komachi

2024

AAAI

View full text Add to dashboard Cite

Wikipedia can be edited by anyone and thus contains various quality sentences. Therefore, Wikipedia includes some poor-quality edits, which are often marked up by other editors. While editors' reviews enhance the credibility of Wikipedia, it is hard to check all edited text. Assisting in this process is very important, but a large and comprehensive dataset for studying it does not currently exist. Here, we propose WikiSQE, the first large-scale dataset for sentence quality estimation in Wikipedia. Each sentence is extracted from the entire revision history of English Wikipedia, and the target quality labels were carefully investigated and selected. WikiSQE has about 3.4 M sentences with 153 quality labels. In the experiment with automatic classification using competitive machine learning models, sentences that had problems with citation, syntax/semantics, or propositions were found to be more difficult to detect. In addition, by performing human annotation, we found that the model we developed performed better than the crowdsourced workers. WikiSQE is expected to be a valuable resource for other tasks in NLP.

show abstract

Section: Related Workmentioning

confidence: 99%

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Ando,

Sekine,

Komachi

2024

AAAI

View full text Add to dashboard Cite

show abstract

“…According to the literature, the classification of wiki edits encompasses the detection of paid [20], puffery [21], reverted [22], [23], [24], toxic [25], [26] and vandal [9], [10], [12], [13], [17], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37], [38] reviews. Similarly, prediction focuses on review quality [39], [40], [41] as well as on editor and article quality [18], [42], [43], [44], [45].…”

Section: B Analysis Of Reviewsmentioning

confidence: 99%

“…In the case of article drafts, ORES returns the probability of being spam, vandalism, an attack, and OK. These scores are used as input features by many of the surveyed works, e.g., [9], [34], [36], [37], [41], to classify reviews. Moreover, ORES is currently used on wiki platforms to help volunteers reduce the burden of manually screening content.…”

Section: ) Vandalism Detectionmentioning

confidence: 99%

“…Using side-based and stylometric profiles, the solution applies a deep neural network to extract quality indicators. [41] shared an annotated data set of English Wikipedia articles based on Wikipedia templates, e.g., original research, contradictory, unreliable sources, etc. The data set was used to predict the content reliability using Logistic Regression [49], Random Forest, and Gradient Boosted Trees [50].…”

Section: ) Quality Predictionmentioning

confidence: 99%

See 1 more Smart Citation

Interpretable Classification of Wiki-Review Streams

García-Méndez,

Leal,

Malheiro

et al. 2023

IEEE Access

View full text Add to dashboard Cite

Wiki articles are created and maintained by a crowd of editors, producing a continuous stream of reviews. Reviews can take the form of additions, reverts, or both. This crowdsourcing model is exposed to manipulation since neither reviews nor editors are automatically screened and purged. To protect articles against vandalism or damage, the stream of reviews can be mined to classify reviews and profile editors in real-time. The goal of this work is to anticipate and explain which reviews to revert. This way, editors are informed why their edits will be reverted. The proposed method employs stream-based processing, updating the profiling and classification models on each incoming event. The profiling uses side and content-based features employing Natural Language Processing, and editor profiles are incrementally updated based on their reviews. Since the proposed method relies on self-explainable classification algorithms, it is possible to understand why a review has been classified as a revert or a non-revert. In addition, this work contributes an algorithm for generating synthetic data for class balancing, making the final classification fairer. The proposed online method was tested with a real data set from Wikivoyage, which was balanced through the aforementioned synthetic data generation. The results attained near-90 % values for all evaluation metrics (accuracy, precision, recall, and F-measure).INDEX TERMS Data reliability and fairness, data-stream processing and classification, synthetic data, transparency, vandalism, wikis.

show abstract

WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia

Hsu

Sáez-Trumper³

et al. 2021

2021 IEEE International Conference on Big Data (Big Data)

View full text Add to dashboard Cite

Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia

Cited by 5 publications

References 11 publications

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

WikiSQE: A Large-Scale Dataset for Sentence Quality Estimation in Wikipedia

Interpretable Classification of Wiki-Review Streams

WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia

Contact Info

Product

Resources

About