Combining Reinforcement Learning and Belief Revision - A Learning System for Active Vision

Leopold, Thomas; Kern-Isberner, Gabriele; Peters, Gabriele

doi:10.5244/c.22.48

Cited by 7 publications

(9 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We restrict ourselves to describing only those aspects of the system, which are relevant for this purpose. For details refer to [8]. The iterative two-level architecture of Sphinx is displayed in figure 1.…”

Section: Example For a Self-learning Systemmentioning

confidence: 99%

“…Only recently these fields of machine learning slowly begin to merge, taking advantage of the mutual benefits of both concepts. One such approach that combines both ideas is the Sphinx system described in [8], [9]. We take this system as an example and a starting point to develop our ideas on the visualization of learning processes.…”

Section: Introductionmentioning

confidence: 99%

“…The symbolic level of if-then rules is realized by means of belief revision. Before the execution of the very first learning step neither any symbolic knowledge nor any numeric knowledge has been acquired, as detailed in[8].of a specific state is the conjunction of the corresponding literals of all of these variables.The possible actions are two rotation actions (to the left and to the right), one recognition action for each of 9 objects • Recognize = {Ball, Bird, Bottle, DVD, Football, House, Pott, TetraPack, Tree} and the decision to not recognize anything. Thus, here we have one variable with 12 possible values:…”

mentioning

confidence: 99%

See 2 more Smart Citations

Visualization of processes in self-learning systems

Peters

Bunte

Strickert

et al. 2012

2012 Tenth Annual International Conference on Privacy, Security and Trust

View full text Add to dashboard Cite

One aspect of self-organizing systems is their desired ability to be self-learning, i.e., to be able to adapt dynamically to conditions in their environment. This quality is awkward especially if it comes to applications in security or safety-sensitive areas. Here a step towards more trustful systems could be taken by providing transparency of the processes of a system. An important means of giving feedback to an operator is the visualization of the internal processes of a system. In this position paper we address the problem of visualizing dynamic processes especially in self-learning systems. We take an existing self-learning system from the field of computer vision as an example from which we derive questions of general interest such as possible options to visualize the flow of information in a dynamic learning system or the visualization of symbolic data. As a side effect the visualization of learning processes may provide a better understanding of underlying principles of learning in general, i.e, also in biological systems. That may also facilitate improved designs of future self-learning systems.

show abstract

Section: Example For a Self-learning Systemmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Visualization of processes in self-learning systems

Peters

Bunte

Strickert

et al. 2012

2012 Tenth Annual International Conference on Privacy, Security and Trust

View full text Add to dashboard Cite

show abstract

“…In this case, the meaning of S ⇒ A and (A|S) is the same. Therefore, on a first attempt, we use (κ * (A|S)) to revise κ with a conditional analogous to (Leopold et al, 2008). Then, we will examine the consequences of such a decision.…”

Section: State-of-the-art Revision Of Ordinal Conditional Functionsmentioning

confidence: 99%

“…While humans are able to learn top-down or bottom-up (Sun et al, 2006), we will focus on the bottom-up part only. A combination of RL and BR has been proposed before (Leopold et al, 2008), influenced by (Sun et al, 2001) and (Ye et al, 2003). While we have already described the general idea of our approach in (Häming and Peters, 2010), we present here the detailed formalism and give a theoretical justification.…”

Section: Introductionmentioning

confidence: 99%

Improved Revision of Ranking Functions for the Generalization of Belief in the Context of Unobserved Variables

Häming¹,

Peters²

2011

Proceedings of the International Conference on Neural Computation Theory and Applications

View full text Add to dashboard Cite

To enable a reinforcement learning agent to acquire symbolical knowledge, we augment it with a high-level knowledge representation. This representation consists of ordinal conditional functions (OCF) which allow it to rank world models. By this means the agent is enabled to complement the self-organizing capabilities of the low-level reinforcement learning sub-system by reasoning capabilities of a high-level learning component. We briefly summarize the state-of-the-art method how new information is included into the OCF. To improve the emergence of plausible behavior, we then introduce a modification of this method. The viability of this modification is examined first, for the inclusion of conditional information with negated consequents and second, for the generalization of belief in the context of unobserved variables. Besides providing a theoretical justification for this modification, we also show the advantages of our approach in comparison to the state-ofthe-art method of revision in a reinforcement learning application. NOTATION AND TERMINOLOGYA variable a can represent a value from its domain D a . Such a domain consists of discrete values. One such realization of a variable is called a literal. We write literals by denoting the variable as a subscript of its value (e.g., 3 a or t a ). A formula consists of literals and logical operators such as ∧, ∨, ⇒, etc. It is

show abstract

An Alternative Approach to the Revision of Ordinal Conditional Functions in the Context of Multi-Valued Logic

Häming

Peters

2010

Artificial Neural Networks – ICANN 2010

View full text Add to dashboard Cite

Combining Reinforcement Learning and Belief Revision - A Learning System for Active Vision

Cited by 7 publications

References 13 publications

Visualization of processes in self-learning systems

Visualization of processes in self-learning systems

Improved Revision of Ranking Functions for the Generalization of Belief in the Context of Unobserved Variables

An Alternative Approach to the Revision of Ordinal Conditional Functions in the Context of Multi-Valued Logic

Contact Info

Product

Resources

About