Generation Probabilities Are Not Enough: Exploring the Effectiveness of Uncertainty Highlighting in AI-Powered Code Completions

Vasconcelos, M. Helena; Bansal, Gagan; Fourney, Adam; Liao, Q. Vera; Vaughan, J.

doi:10.48550/arxiv.2302.07248

Cited by 3 publications

(5 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…What is a useful notion of uncertainty for LLMs? While LLMs have a notion of uncertainty baked into them -the likelihood that the model would generate a specific token given its preceding or surrounding context (Bengio et al, 2003), what we have referred to in past work as the generation probability (Vasconcelos et al, 2023)-whether this notion would be useful to different stakeholders is questionable. In particular, this notion may not line up with people's intuition about what it means for the model to be uncertain.…”

Section: Communicating Uncertaintymentioning

confidence: 99%

“…Carefully selecting a notion of uncertainty to convey to stakeholders matters because the particular notion used impacts their behavior and trust. In our recent work with collaborators (Vasconcelos et al, 2023) provided in (e.g., its precision and modality), and what the effect is (e.g., on trust or behaviors), as well as taking into consideration the characteristics of the receiver (Van Der Bles et al, 2019). For example, in our study on uncertainty in the context of code completion tools (Vasconcelos et al, 2023), by soliciting participants' feedback on different uncertainty communication design choices, we found that programmers prefer uncertainty about granular or meaningful blocks to guide them to make token-level changes and prefer less precise communication (as opposed to exact quantification) for easy processing-both ultimately supporting their goal of producing correct code efficiently.…”

Section: Communicating Uncertaintymentioning

confidence: 99%

See 1 more Smart Citation

AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap

Liao,

Wortman Vaughan

2024

Special Issue 5: Grappling With the Generative AI Revolution

View full text Add to dashboard Cite

The rise of powerful large language models (LLMs) brings about tremendous opportunities for innovation but also looming risks for individuals and society at large. We have reached a pivotal moment for ensuring that LLMs and LLM-infused applications are developed and deployed responsibly. It is paramount to pursue new approaches to provide transparency-a central pillar of responsible artificial intelligence (AI)-for LLMs, and years of research at the intersection of AI and human-computer interaction (HCI) highlight that we must do so with a human-centered perspective: Transparency is fundamentally about supporting appropriate human understanding, and this understanding is sought by different stakeholders with different goals in different contexts. In this new era of LLMs, we must develop and design approaches to transparency by considering the needs of stakeholders in the emerging LLM ecosystem, the novel types of LLM-infused applications being built, and the new usage patterns and challenges around LLMs, all while building on lessons learned about how people process, interact with, and make use of information. We reflect on the unique challenges that arise in providing transparency for LLMs, along with lessons learned from HCI and responsible AI research that has taken a human-centered perspective on AI transparency. We then lay out four common approaches that the community has taken to achieve transparency-model reporting, publishing evaluation results, providing explanations, and communicating uncertainty-and call out open questions around how these approaches may or may not be applied to LLMs. We hope this provides a starting point for discussion and a useful roadmap for future research.

show abstract

Section: Communicating Uncertaintymentioning

confidence: 99%

Section: Communicating Uncertaintymentioning

confidence: 99%

AI Transparency in the Age of LLMs: A Human-Centered Research Roadmap

Liao,

Wortman Vaughan

2024

Special Issue 5: Grappling With the Generative AI Revolution

View full text Add to dashboard Cite

show abstract

“…What is a useful notion of uncertainty for LLMs? While LLMs have a notion of uncertainty baked in them-the likelihood that the model would generate a specific token given its preceding or surrounding context [17], what we have referred to in past work as the generation probability [178]whether this notion would be useful to different stakeholders is questionable. In particular, this notion may not line up with people's intuition about what it means for the model to be uncertain.…”

Section: Communicating Uncertaintymentioning

confidence: 99%

“…Carefully selecting a notion of uncertainty to convey to stakeholders matters because the particular notion used impacts their behavior and trust. In our recent work with collaborators [178], we explored the effectiveness of displaying two alternative notions of uncertainty to programmers interacting with an LLM-powered code completion tool. In a mixed-methods study with 30 programmers, we compared three conditions: providing a code completion alone, highlighting those tokens with the lowest likelihood of being generated by the underlying LLM (i.e., lowest generation probability), and highlighting tokens with the highest predicted likelihood of being edited by a programmer according to a separate "edit model" trained on logged data from past programmer interactions.…”

Section: Communicating Uncertaintymentioning

confidence: 99%

“…The social science literature suggests that choosing an effective form of uncertainty communication requires articulating what the uncertainty is regarding (e.g., uncertainty about an individual token or about a full output, and which source of uncertainty), what form it is provided in (e.g., its precision and modality), and what the effect is (e.g., on trust or behaviors), as well as taking into consideration of the characteristics of the receiver [177]. For example, in our study on uncertainty in the context of code completion tools [178], by soliciting participants' feedback on different uncertainty communication design choices, we found that programmers prefer uncertainty about granular or meaningful blocks to guide them to make token-level changes and prefer less precise communication (as opposed to exact quantification) for easy processing-both ultimately supporting their goal of producing correct code efficiently.…”

Section: Communicating Uncertaintymentioning

confidence: 99%

See 1 more Smart Citation

Designerly Understanding: Information Needs for Model Transparency to Support Design Ideation for AI-Powered User Experience

Liao

Subramonyam

Wang

et al. 2023

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

The rise of powerful large language models (LLMs) brings about tremendous opportunities for innovation but also looming risks for individuals and society at large. We have reached a pivotal moment for ensuring that LLMs and LLM-infused applications are developed and deployed responsibly. However, a central pillar of responsible AI-transparency-is largely missing from the current discourse around LLMs. It is paramount to pursue new approaches to provide transparency for LLMs, and years of research at the intersection of AI and human-computer interaction (HCI) highlight that we must do so with a human-centered perspective: Transparency is fundamentally about supporting appropriate human understanding, and this understanding is sought by different stakeholders with different goals in different contexts. In this new era of LLMs, we must develop and design approaches to transparency by considering the needs of stakeholders in the emerging LLM ecosystem, the novel types of LLM-infused applications being built, and the new usage patterns and challenges around LLMs, all while building on lessons learned about how people process, interact with, and make use of information. We reflect on the unique challenges that arise in providing transparency for LLMs, along with lessons learned from HCI and responsible AI research that has taken a human-centered perspective on AI transparency. We then lay out four common approaches that the community has taken to achieve transparency-model reporting, publishing evaluation results, providing explanations, and communicating uncertainty-and call out open questions around how these approaches may or may not be applied to LLMs. We hope this provides a starting point for discussion and a useful roadmap for future research.

show abstract