Curiosity-Driven Exploration via Latent Bayesian Surprise

Mazzaglia, Pietro; Çatal, Ozan; Verbelen, Tim; Dhoedt, Bart

doi:10.1609/aaai.v36i7.20743

Cited by 9 publications

(5 citation statements)

References 22 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Studies exploring emotion in AI seldom consider both a robust psychological and a neurophysiological bases for its formulation, with fewer even employing it as a driver of learning factors. Regardless, our work relates with literature such as [21], wherein Mazzaglia et al developed a latent dynamics model endowed with Bayesian surprise as the dissimilarity from its posterior to prior beliefs, which rewards exploration when occurring. Schillaci et al also presented a process for estimating the change in prediction error (PE) as a metric of learning progress [26].…”

Section: Discussionmentioning

confidence: 95%

Self-mediated exploration in artificial intelligence inspired by cognitive psychology

Assunção¹,

Castelo‐Branco²,

Menezes³

2023

Preprint

View full text Add to dashboard Cite

Exploration of the physical environment is an indispensable precursor to data acquisition and enables knowledge generation via analytical or direct trialing. Artificial Intelligence lacks the exploratory capabilities of even the most underdeveloped organisms, hindering its autonomy and adaptability. Supported by cognitive psychology, this works links human behavior and artificial agents to endorse self-development. In accordance with reported data, paradigms of epistemic and achievement emotion are embedded to machine-learning methodology contingent on their impact when decision making. A study is subsequently designed to mirror previous human trials, which artificial agents are made to undergo repeatedly towards convergence. Results demonstrate causality, learned by the vast majority of agents, between their internal states and exploration to match those reported for human counterparts. The ramifications of these findings are pondered for both research into human cognition and betterment of artificial intelligence.

show abstract

Section: Discussionmentioning

confidence: 95%

Self-mediated exploration in artificial intelligence inspired by cognitive psychology

Assunção¹,

Castelo‐Branco²,

Menezes³

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…KB intrinsic motivations derive from comparisons between pre-existing and newly acquired information and are well-documented in the animal world (e.g., [1,33,34,3,35,6,5,36,37,8]). In artificial agents, KB signals of novelty, diversity, or prediction error broaden exploration when provided alongside extrinsic rewards [13,15,38,16,39] or even on their own ( [14,11], see [40] for review). However, human curiosity is often driven towards stimuli of intermediate complexity [41,42,7], rather than extremes.…”

Section: Related Workmentioning

confidence: 99%

Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning

Molinaro,

Colas,

Oudeyer

et al. 2024

Preprint

View full text Add to dashboard Cite

Humans are autotelic agents who learn by setting and pursuing their own goals. However, the precise mechanisms guiding human goal selection remain unclear. Learning progress, typically measured as the observed change in performance, can provide a valuable signal for goal selection in both humans and artificial agents. We hypothesize that human choices of goals may also be driven by latent learning progress, which humans can estimate through knowledge of their actions and the environment – even without experiencing immediate changes in performance. To test this hypothesis, we designed a hierarchical reinforcement learning task in which human participants (N = 175) repeatedly chose their own goals and learned goal-conditioned policies. Our behavioral and computational modeling results confirm the influence of latent learning progress on goal selection and uncover inter-individual differences, partially mediated by recognition of the environment’s hierarchical structure. By investigating the role of latent learning progress in human goal selection, we pave the way for more effective and personalized learning experiences as well as the advancement of more human-like autotelic machines.

show abstract

“…State-driven Exploration. Maximizing mutual information between states and observations has also been studied in RL for exploration, using the Bayesian surprise signal given by the D KL divergence between the (autoencoding) posterior and the prior of the model as a reward [51]. Alternatively, the surprisal with respect to future observations has also been used in RL to generate an intrinsic motivation signal that rewards exploration [148,52,149].…”

Section: Epistemics Exploration and Ambiguitymentioning

confidence: 99%

“…In deep learning, generative models have been widely studied, obtaining outstanding results in several domains, such as image generation [38,39,40], text prediction [41,42,43], and video modeling [44,45,46,47]. In particular, temporal deep generative models that allow predicting the dynamics of a system, i.e., the environment or world, have been studied for control [48,49,50], curiosity and exploration [51,52,53], and anomaly detection [54]. Several of these models have been used in settings that are similar to the active inference one, and some of them even share some similarities with the active inference objective of minimizing variational free energy.…”

Section: Introductionmentioning

confidence: 99%

The Free Energy Principle for Perception and Action: A Deep Learning Perspective

Mazzaglia

Verbelen

Çatal

et al. 2022

Entropy

Self Cite

View full text Add to dashboard Cite

The free energy principle, and its corollary active inference, constitute a bio-inspired theory that assumes biological agents act to remain in a restricted set of preferred states of the world, i.e., they minimize their free energy. Under this principle, biological agents learn a generative model of the world and plan actions in the future that will maintain the agent in an homeostatic state that satisfies its preferences. This framework lends itself to being realized in silico, as it comprehends important aspects that make it computationally affordable, such as variational inference and amortized planning. In this work, we investigate the tool of deep learning to design and realize artificial agents based on active inference, presenting a deep-learning oriented presentation of the free energy principle, surveying works that are relevant in both machine learning and active inference areas, and discussing the design choices that are involved in the implementation process. This manuscript probes newer perspectives for the active inference framework, grounding its theoretical aspects into more pragmatic affairs, offering a practical guide to active inference newcomers and a starting point for deep learning practitioners that would like to investigate implementations of the free energy principle.

show abstract

Curiosity-Driven Exploration via Latent Bayesian Surprise

Cited by 9 publications

References 22 publications

Self-mediated exploration in artificial intelligence inspired by cognitive psychology

Self-mediated exploration in artificial intelligence inspired by cognitive psychology

Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning

The Free Energy Principle for Perception and Action: A Deep Learning Perspective

Contact Info

Product

Resources

About