Lukas Finnveden scite author profile

Lukas Finnveden

2Publications

7Citation Statements Received

107Citation Statements Given

How they've been cited

How they cite others

107

Affiliations

KTH Royal Institute of Technology

Publications

Order By: Most citations

Truthful AI: Developing and governing AI that does not lie

Evans¹,

Cotton-Barratt²,

Finnveden³

et al. 2021

Preprint

View full text Add to dashboard Cite

In many contexts, lying -the use of verbal falsehoods to deceive -is harmful. While lying has traditionally been a human affair, AI systems that make sophisticated verbal statements are becoming increasingly prevalent. This raises the question of how we should limit the harm caused by AI "lies" (i.e. falsehoods that are actively selected for). Human truthfulness is governed by social norms and by laws (against defamation, perjury, and fraud). Differences between AI and humans present an opportunity to have more precise standards of truthfulness for AI, and to have these standards rise over time. This could provide significant benefits to public epistemics and the economy, and mitigate risks of worst-case AI futures.Establishing norms or laws of AI truthfulness will require significant work to:1. identify clear truthfulness standards; 2. create institutions that can judge adherence to those standards; and 3. develop AI systems that are robustly truthful.Our initial proposals for these areas include:1. a standard of avoiding "negligent falsehoods" (a generalisation of lies that is easier to assess);2. institutions to evaluate AI systems before and after real-world deployment;3. explicitly training AI systems to be truthful via curated datasets and human interaction.A concerning possibility is that evaluation mechanisms for eventual truthfulness standards could be captured by political interests, leading to harmful censorship and propaganda. Avoiding this might take careful attention. And since the scale of AI speech acts might grow dramatically over the coming decades, early truthfulness standards might be particularly important because of the precedents they set.

show abstract

Understanding when spatial transformer networks do not support invariance, and what to do about it

Finnveden

Jansson

Lindeberg

2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lukas Finnveden

Truthful AI: Developing and governing AI that does not lie

Understanding when spatial transformer networks do not support invariance, and what to do about it

Contact Info

Product

Resources

About