Designing Preferences, Beliefs, and Identities for Artificial Intelligence

Conitzer, Vincent

doi:10.1609/aaai.v33i01.33019755

Cited by 8 publications

(4 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…More recently, much work in AI alignment has fallen under the embedded agency paradigm. The process of understanding optimal and predictable behavior for agents embedded inside of an environment is complicated by conceptual challenges involving an agent's identity and world model [22,24]. Progress has been made through the formulation of Functional Decision Theory [68,18] which offers a framework for understanding optimal behavior in terms of having an optimal policy as opposed to making optimal choices.…”

Section: Related Workmentioning

confidence: 99%

Achilles Heels for AGI/ASI via Decision Theoretic Adversaries

Casper¹

2020

Preprint

View full text Add to dashboard Cite

As progress in AI continues to advance at a rapid pace, it is crucial to know how advanced systems will make choices and in what ways they may fail. Machines can already outsmart humans in some domains, and understanding how to safely build systems which may have capabilities at or above the human level is of particular concern. One might suspect that superhumanly-intelligent systems should be modeled as as something which humans, by definition, can't outsmart. However, as a challenge to this assumption, this paper presents the Achilles Heel hypothesis which states that highly-effective goal-oriented systemseven ones that are potentially superintelligent-may nonetheless have stable decision theoretic delusions which cause them to make obviously irrational decisions in adversarial settings. In a survey of relevant dilemmas and paradoxes from the decision theory literature, a number of these potential Achilles Heels are discussed in context of this hypothesis. Several novel contributions are made involving the ways in which these weaknesses could be implanted into a system.

show abstract

Section: Related Workmentioning

confidence: 99%

Achilles Heels for AGI/ASI via Decision Theoretic Adversaries

Casper¹

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Belief propagation has been well studied for a long time especially by traditional methods [9,13]. Actually, the concept of belief propagation has also been exploited by various deep networks.…”

Section: Deep Decision Trees/forestsmentioning

confidence: 99%

Decision Propagation Networks for Image Classification

Tang,

Song,

et al. 2019

Preprint

View full text Add to dashboard Cite

High-level (e.g., semantic) features encoded in the latter layers of convolutional neural networks are extensively exploited for image classification, leaving low-level (e.g., color) features in the early layers underexplored. In this paper, we propose a novel Decision Propagation Module (DPM) to make an intermediate decision that could act as category-coherent guidance extracted from early layers, and then propagate it to the latter layers. Therefore, by stacking a collection of DPMs into a classification network, the generated Decision Propagation Network is explicitly formulated as to progressively encode more discriminative features guided by the decision, and then refine the decision based on the new generated features layer by layer. Comprehensive results on four publicly available datasets validate DPM could bring significant improvements for existing classification networks with minimal additional computational cost and is superior to the state-of-the-art methods.

show abstract

“…It is certainly not a novel observation that we need to think carefully about what it is that we are trying to optimize. Just to give a couple of recent examples, Conitzer (2019) discusses the importance of appropriately designing preferences and optimization goals for AI agents, while a core argument of O'Neil and Gunn (2020) is that many of the problems of "near-term AI" (defined as expert systems that replace human decision-makers) are driven by a mismatch between the performance metrics of the AI (constructed by the algorithm designers) and the true objectives of stakeholders. Nevertheless, it is useful to get a sense of where the academic community has gone in response to these concerns.…”

Section: Introductionmentioning

confidence: 99%

Local Justice and the Algorithmic Allocation of Scarce Societal Resources

Das

2022

AAAI

View full text Add to dashboard Cite

AI is increasingly used to aid decision-making about the allocation of scarce societal resources, for example housing for homeless people, organs for transplantation, and food donations. Recently, there have been several proposals for how to design objectives for these systems that attempt to achieve some combination of fairness, efficiency, incentive compatibility, and satisfactory aggregation of stakeholder preferences. This paper lays out possible roles and opportunities for AI in this domain, arguing for a closer engagement with the political philosophy literature on local justice, which provides a framework for thinking about how societies have over time framed objectives for such allocation problems. It also discusses how we may be able to integrate into this framework the opportunities and risks opened up by the ubiquity of data and the availability of algorithms that can use them to make accurate predictions about the future.

show abstract

Designing Preferences, Beliefs, and Identities for Artificial Intelligence

Cited by 8 publications

References 20 publications

Achilles Heels for AGI/ASI via Decision Theoretic Adversaries

Achilles Heels for AGI/ASI via Decision Theoretic Adversaries

Decision Propagation Networks for Image Classification

Local Justice and the Algorithmic Allocation of Scarce Societal Resources

Contact Info

Product

Resources

About