Researchers Gone Wild

Harvey, Adam; LaPlace, Jules

doi:10.14361/9783839457603-016

Cited by 5 publications

(12 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The original paper gave no specific motivating applications, but did note that "Recognizing celebrities, rather than a pre-selected private group of people, represents public interest and could be directly applied to a wide range of real scenarios." Researchers and journalists noted in 2019 that many of the "celebrities" were in fact fairly ordinary citizens, and that the images were aggregated without consent [36,59]. Several corporations tied to mass surveillance operations were also found to use the dataset in research papers [36,59].…”

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

“…Researchers and journalists noted in 2019 that many of the "celebrities" were in fact fairly ordinary citizens, and that the images were aggregated without consent [36,59]. Several corporations tied to mass surveillance operations were also found to use the dataset in research papers [36,59]. The dataset was taken down in June 2019.…”

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

“…The images were collected from video footage taken on Duke's campus. The same reports on MS-Celeb-1M listed above [36,59] noted that the DukeMTMC was also being used by corporations tied to mass surveillance operations, and also noted the lack of consent given by people included in the dataset. The creators removed the dataset in June 2019, apologizing, noting that they had inadvertently broken guidelines provided by the Duke University IRB.…”

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

“…When datasets are deemed problematic by the machine learning community, activists, or the media, dataset creators have responded by retracting them. MS-Celeb-1M [32], DukeMTMC [65], VGGFace [60], and Brainwash [69] were all retracted after an investigation by Harvey and Laplace [36] that pointed out ethical concerns with how the data was collected and being used. TinyImages [72] was retracted following a report by Prabhu and Birhane [62] that raised ethical concerns involving offensive labels in the dataset.…”

Section: Retractions and Runaway Datamentioning

confidence: 99%

“…Datasets play an essential role in machine learning research, but are also the source of ethical concerns. These include concerns about the privacy of individuals included in datasets [36,62], representational harms introduced by annotations [20,35], effects of biases in datasets on downstream use [16,17,15], and the use of datasets for ethically dubious purposes [36,67,59]. These concerns have led to the retractions of several prominent research datasets including Tiny Images, VGGFace2, DukeMTMC, and MS-Celeb-1M.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Mitigating Dataset Harms Requires Stewardship: Lessons from 1000 Papers

Peng¹,

Mathur²,

Narayanan³

2021

Preprint

View full text Add to dashboard Cite

Concerns about privacy, bias, and harmful applications have shone a light on the ethics of machine learning datasets, even leading to the retraction of prominent datasets including DukeMTMC, MS-Celeb-1M, TinyImages, and VGGFace2. In response, the machine learning community has called for higher ethical standards, transparency efforts, and technical fixes in the dataset creation process. The premise of our work is that these efforts can be more effective if informed by an understanding of how datasets are used in practice in the research community. We study three influential face and person recognition datasets-DukeMTMC, MS-Celeb-1M, and Labeled Faces in the Wild (LFW)-by analyzing nearly 1000 papers that cite them. We found that the creation of derivative datasets and models, broader technological and social change, the lack of clarity of licenses, and dataset management practices can introduce a wide range of ethical concerns. We conclude by suggesting a distributed approach that can mitigate these harms, making recommendations to dataset creators, conference program committees, dataset users, and the broader research community.

show abstract

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

Section: Overview Of the Datasets And Analysismentioning

confidence: 99%

Section: Retractions and Runaway Datamentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Mitigating Dataset Harms Requires Stewardship: Lessons from 1000 Papers

Peng¹,

Mathur²,

Narayanan³

2021

Preprint

View full text Add to dashboard Cite

show abstract

Real Risks of Fake Data: Synthetic Data, Diversity-Washing and Consent Circumvention

Whitney,

Norman

2024

The 2024 ACM Conference on Fairness, Accountability, and Transparency

View full text Add to dashboard Cite

The unbearable (technical) unreliability of automated facial emotion recognition

2022

View full text Add to dashboard Cite

Emotion recognition, and in particular acial emotion recognition (FER), is among the most controversial applications of machine learning, not least because of its ethical implications for human subjects. In this article, we address the controversial conjecture that machines can read emotions from our facial expressions by asking whether this task can be performed reliably. This means, rather than considering the potential harms or scientific soundness of facial emotion recognition systems, focusing on the reliability of the ground truths used to develop emotion recognition systems, assessing how well different human observers agree on the emotions they detect in subjects’ faces. Additionally, we discuss the extent to which sharing context can help observers agree on the emotions they perceive on subjects’ faces. Briefly, we demonstrate that when large and heterogeneous samples of observers are involved, the task of emotion detection from static images crumbles into inconsistency. We thus reveal that any endeavour to understand human behaviour from large sets of labelled patterns is over-ambitious, even if it were technically feasible. We conclude that we cannot speak of actual accuracy for facial emotion recognition systems for any practical purposes.

show abstract

Researchers Gone Wild

Cited by 5 publications

References 6 publications

Mitigating Dataset Harms Requires Stewardship: Lessons from 1000 Papers

Mitigating Dataset Harms Requires Stewardship: Lessons from 1000 Papers

Real Risks of Fake Data: Synthetic Data, Diversity-Washing and Consent Circumvention

The unbearable (technical) unreliability of automated facial emotion recognition

Contact Info

Product

Resources

About