Andrei Nesterov scite author profile

Andrei Nesterov

2Publications

0Citation Statements Received

12Citation Statements Given

How they've been cited

How they cite others

Affiliations

Centrum Wiskunde & Informatica

Publications

Order By: Most citations

A Knowledge Graph of Contentious Terminology for Inclusive Representation of Cultural Heritage

Nesterov

Hollink

Erp³

et al. 2023

View full text Add to dashboard Cite

Cultural heritage collections available as linked open data (LOD) may contain harmful stereotypes about people and cultures, for example, in outdated textual descriptions of objects. Galleries, libraries, archives, and museums (GLAM) have suggested various approaches to tackle potentially problematic content in digital collections. However, the domain expertise and discussions about words and phrases used in LOD-collections are scattered across different resources and detached from the collections themselves. In this paper, we capture domain expertise about English and Dutch contentious heritage terminology in a knowledge graph. Contentious terms in the resulting graph are then linked to entities from other LOD-resources used in the cultural domain and beyond, including Wikidata and WordNet. We make our design decisions explicit and report on the linking process. The developed knowledge graph makes expert knowledge interoperable, so it can be reused by the cultural heritage community and other LOD-developers to contribute to a more inclusive representation of cultural heritage on the Web.

show abstract

Capturing Contentiousness

Brate¹,

Nesterov

Vogelmann³

et al. 2021

View full text Add to dashboard Cite

Recent initiatives by cultural heritage institutions in addressing outdated and offensive language used in their collections demonstrate the need for further understanding into when terms are problematic or contentious. This paper presents an annotated dataset of 2,715 unique samples of terms in context, drawn from a historical newspaper archive, collating 21,800 annotations of contentiousness from expert and crowd workers.We describe the contents of the corpus by analysing inter-rater agreement and differences between experts and crowd workers. In addition, we demonstrate the potential of the corpus for automated detection of contentiousness. We show that a simple classifier applied to the embedding representation of a target word provides a better than baseline performance in predicting contentiousness. We find that the term itself and the context play a role in whether a term is considered contentious. CCS CONCEPTS• Information systems → Digital libraries and archives; • Computing methodologies → Knowledge representation and reasoning.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Andrei Nesterov

A Knowledge Graph of Contentious Terminology for Inclusive Representation of Cultural Heritage

Capturing Contentiousness

Contact Info

Product

Resources

About