2017
DOI: 10.1007/978-3-662-54033-6_5
|View full text |Cite
|
Sign up to set email alerts
|

Agent Foundations for Aligning Machine Intelligence with Human Interests: A Technical Research Agenda

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
64
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 59 publications
(69 citation statements)
references
References 17 publications
0
64
0
Order By: Relevance
“…This section will discuss some issues that nonetheless arise, and ways in which those issues can potentially be addressed. For more comprehensive overviews of safety concerns of intelligent agents, see [4,21,76,83]. …”
Section: Predicting and Controlling Behaviourmentioning
confidence: 99%
“…This section will discuss some issues that nonetheless arise, and ways in which those issues can potentially be addressed. For more comprehensive overviews of safety concerns of intelligent agents, see [4,21,76,83]. …”
Section: Predicting and Controlling Behaviourmentioning
confidence: 99%
“…The question of how to model agents as an ordinary part of the environment is of interest in the speculative study of human-level and smarter-than-human artificial intelligence [13,14]. Although such systems are still firmly in the domain of futurism, there has been a recent wave of interest in foundational research aimed at understanding their behavior, in order to ensure that they will behave as intended if and when they are developed [15,16,14].…”
Section: Related Workmentioning
confidence: 99%
“…$15.00 https://doi.org /10.1145/3375627.3375872 Algorithmic bias fits into a larger body of concerns dealing with the integration and performance of technology artifacts in society. The problem of value alignment [2,15,22] captures this larger body of concerns: how can we design systems that achieve their specific objectives while remaining aligned with the broader values of society? The values implicated can include fairness, privacy, safety, trustworthiness, etc.…”
Section: Introductionmentioning
confidence: 99%