Feature-Based Explanations Don't Help People Detect Misclassifications of Online Toxicity

Carton, Samuel; Mei, Qiaozhu; Resnick, Paul

doi:10.1609/icwsm.v14i1.7282

Cited by 31 publications

(9 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Withholding predictions of an uncalibrated model may improve decision quality. Consistent with prior work on AI-advised decision-making (e.g., [7,10,21,38]), our results suggest that when a model is well-calibrated and more accurate than humans alone, users with access to its predictions can perform better than without the model, but not as well as the model alone for easier instances. When the model is poorly calibrated, the type of prediction display affects whether people can perform better by accessing the model predictions.…”

Section: Discussionsupporting

confidence: 86%

Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

Zhang,

Chatzimparmpas,

Kamali

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

As deep neural networks are more commonly deployed in highstakes domains, their black-box nature makes uncertainty quantification challenging. We investigate the effects of presenting conformal prediction sets-a distribution-free class of methods for generating prediction sets with specified coverage-to express uncertainty in AI-advised decision-making. Through a large online experiment, we compare the utility of conformal prediction sets to displays of Top-1 and Top-𝑘 predictions for AI-advised image labeling. In a pre-registered analysis, we find that the utility of prediction sets for accuracy varies with the difficulty of the task: while they result in accuracy on par with or less than Top-1 and Top-𝑘 displays for easy images, prediction sets excel at assisting humans in labeling out-of-distribution (OOD) images, especially when the set size is small. Our results empirically pinpoint practical challenges of conformal prediction sets and provide implications on how to incorporate them for real-world decision-making. CCS CONCEPTS• Human-centered computing → Human computer interaction (HCI); Empirical studies in HCI; Visualization design and evaluation methods; Empirical studies in visualization.

show abstract

Section: Discussionsupporting

confidence: 86%

Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

Zhang,

Chatzimparmpas,

Kamali

et al. 2024

Proceedings of the CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

“…In recent years, there has been a surge of research in human-AI decision-making, with a growing number of studies conducting behavioral experiments to gain a better understanding of how humans form decisions in the presence of AI (Alufaisan et al n.d.;Buçinca et al 2020;Carton et al 2020;Lai et al 2020;Lai and Tan 2019;Liu et al 2021;Zhang et al 2020). This research has focused on improving human-AI decisionmaking to optimize team performance (Buçinca et al 2020;Zhang et al 2020).…”

Section: Human-ai Decision-making and Appropriate Reliancementioning

confidence: 99%

A Meta-Analysis of the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Schemmer

Hemmer

Nitsche

et al. 2022

Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society

View full text Add to dashboard Cite

The true potential of human-AI collaboration lies in exploiting the complementary capabilities of humans and AI to achieve a joint performance superior to that of the individual AI or human, i.e., to achieve complementary team performance (CTP). To realize this complementarity potential, humans need to exercise discretion in following AI's advice, i.e., appropriately relying on the AI's advice. While previous work has focused on building a mental model of the AI to assess AI recommendations, recent research has shown that the mental model alone cannot explain appropriate reliance. We hypothesize that, in addition to the mental model, human learning is a key mediator of appropriate reliance and, thus, CTP. In this study, we demonstrate the relationship between learning and appropriate reliance in an experiment with 100 participants. This work provides fundamental concepts for analyzing reliance and derives implications for the effective design of human-AI decision-making.

show abstract

“…Thus, the AI model often provides the confidence level of the decision [51,75] or an additional explanation for its decision [1,42]. Several works have evaluated whether different types of explanations can support humans' understanding of the AI model so that they identify the right cases to rely on the recommendations [4,13,15,69]. Explanations can lead people to rely too much on the decision of the AI model, particularly when its suggestion is incorrect [6].…”

Section: Related Workmentioning

confidence: 99%

Human-AI Collaboration: The Effect of AI Delegation on Human Task Performance and Task Satisfaction

Hemmer

Westphal

Schemmer

et al. 2023

Proceedings of the 28th International Conference on Intelligent User Interfaces

View full text Add to dashboard Cite

Recent work has proposed artificial intelligence (AI) models that can learn to decide whether to make a prediction for an instance of a task or to delegate it to a human by considering both parties' capabilities. In simulations with synthetically generated or contextindependent human predictions, delegation can help improve the performance of human-AI teams-compared to humans or the AI model completing the task alone. However, so far, it remains unclear how humans perform and how they perceive the task when they are aware that an AI model delegated task instances to them. In an experimental study with 196 participants, we show that task performance and task satisfaction improve through AI delegation, regardless of whether humans are aware of the delegation. Additionally, we identify humans' increased levels of self-efficacy as the underlying mechanism for these improvements in performance and satisfaction. Our findings provide initial evidence that allowing AI models to take over more management responsibilities can be an effective form of human-AI collaboration in workplaces. CCS CONCEPTS• Human-centered computing → Empirical studies in HCI; • Computing methodologies → Artificial intelligence.

show abstract

Feature-Based Explanations Don't Help People Detect Misclassifications of Online Toxicity

Cited by 31 publications

References 32 publications

Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

A Meta-Analysis of the Utility of Explainable Artificial Intelligence in Human-AI Decision-Making

Human-AI Collaboration: The Effect of AI Delegation on Human Task Performance and Task Satisfaction

Contact Info

Product

Resources

About