Integrated learning for interactive synthetic characters

Blumberg, Bruce; Downie, Marc; Ivanov, Yuri; Berlin, Matt; Johnson, Michael P.; Tomlinson, Bill

doi:10.1145/566654.566597

Cited by 81 publications

(77 citation statements)

References 8 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The cognitive and learning system extends the C5M architecture [Blumberg02]. The Perception and Belief Systems are most relevant to the learning abilities described in this paper.…”

Section: Robot Platformmentioning

confidence: 99%

“…For example, many prior works have given a human trainer control a reinforcement learner's reward [Blumberg02,Kaplan02,Saksida98], allow a human to provide advice [Clouse92,Maclin05], or have the human tele-operate the agent during training [Smart02]. Exploration approaches have the benefit that learning does not require the human's undivided attention.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers

Thomaz

Breazeal

2008

Connection Science

View full text Add to dashboard Cite

We present a learning system, Socially Guided Exploration, in which a social robot learns new tasks through a combination of self-exploration and interpersonal interaction. The system's motivational drives (novelty, mastery), along with social scaffolding from a human partner, bias behavior to create learning opportunities for a Reinforcement Learning mechanism. The robot is able to learn on its own, but can flexibly use the guidance of a human teacher to improve performance. We report the results of a series of experiments where the robot learns on its own in addition to being taught by human subjects. We analyze these interactions to understand human teaching behavior and the social dynamics of the human-teacher/robot-learner system. With respect to learning performance, human guidance results in a task set that is significantly more focused and efficient, while self-exploration results in a broader set. Analysis of human teaching behavior reveals insights of social coupling between human teacher and robot learner, different teaching styles, strong consistency in the kinds and frequency of scaffolding acts across teachers, and nuance in the communicative intent behind positive and negative feedback. AbstractWe present a learning system, Socially Guided Exploration, in which a social robot learns new tasks through a combination of self-exploration and interpersonal interaction. The system's motivational drives (novelty, mastery), along with social scaffolding from a human partner, bias behavior to create learning opportunities for a Reinforcement Learning mechanism. The robot is able to learn on its own, but can flexibly use the guidance of a human teacher to improve performance. We report the results of a series of experiments where the robot learns on its own in addition to being taught by human subjects. We analyze these interactions to understand human teaching behavior and the social dynamics of the human-teacher/robot-learner system. With respect to learning performance, human guidance results in a task set that is significantly more focused and efficient, while self-exploration results in a broader set. Analysis of human teaching behavior reveals insights of social coupling between human teacher and robot learner, different teaching styles, strong consistency in the kinds and frequency of scaffolding acts across teachers, and nuance in the communicative intent behind positive and negative feedback.

show abstract

“…The cognitive and learning system extends the C5M architecture [Blumberg02]. The Perception and Belief Systems are most relevant to the learning abilities described in this paper.…”

Section: Robot Platformmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers

Thomaz

Breazeal

2008

Connection Science

View full text Add to dashboard Cite

show abstract

“…Kaplan et al [13] and Blumberg et al [14] respectively implement clicker training on a robotic and a simulated dog. Blumberg et al's system is especially interesting, allowing the dog to learn multi-action sequences and associate them with verbal cues.…”

Section: Extracting Reward Signal From a Humanmentioning

confidence: 99%

TAMER: Training an Agent Manually via Evaluative Reinforcement

Knox

Stone

2008

2008 7th IEEE International Conference on Development and Learning

View full text Add to dashboard Cite

Abstract-Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is desirable for a human to be able to transmit knowledge to a learning agent as quickly and effortlessly as possible, and, ideally, without any knowledge of the details of the agent's learning process. This paper proposes a general framework called Training an Agent Manually via Evaluative Reinforcement (TAMER) that allows a human to train a learning agent to perform a common class of complex tasks simply by giving scalar reward signals in response to the agent's observed actions. Specifically, in sequential decision making tasks, an agent models the human's reward function and chooses actions that it predicts will receive the most reward. Our novel algorithm is fully implemented and tested on the game Tetris. Leveraging the human trainers' feedback, the agent learns to clear an average of more than 50 lines by its third game, an order of magnitude faster than the best autonomous learning agents.

show abstract

“…Through reinforcement learning, the virtual dog in [3] gradually learns to perform new behaviors. Many other works also proposed similar ways to animate virtual characters [12,32,6].…”

Section: Related Workmentioning

confidence: 99%

“…Unfortunately, behavior modeling has long been a difficult and cumbersome task. To enable nonprogrammers to create such "behavioral animation" [3,27,32] more easily, a large amount of work have been proposed in the research community [14,25], as well as in the commercial domain 1 . These techniques, although attempted to be user-friendly and powerful, still ask for "literate users" who are familiar with finite state machines.…”

Section: Introductionmentioning

confidence: 99%

Toward gesture-based behavior authoring

Shen

Chen

2005

International 2005 Computer Graphics

View full text Add to dashboard Cite

Creating lifelike, autonomous, and interactive virtual behaviors is important in generating character animation, such as animal crowds, pedestrians, battle scenes, etc. Unfortunately, such task has long been limited to skilled users, since the authoring tools, including script languages and other commercial programs, mostly require lengthy prelearning process or are difficult to use. A novel approach, gesture-based behavior authoring, is proposed to open the interesting experience of creating autonomous animated characters to novice users. The technique enables users to efficiently prototype behaviors of a character, with the potential for further refinements. With several testees, our gesture-based authoring manner has been verified to be beneficial to the addressed problem , and, grounded on the HCI (Human-Computer Interaction) literature, the authoring process is direct, easy, and enjoyable.

show abstract

Integrated learning for interactive synthetic characters

Cited by 81 publications

References 8 publications

Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers

Experiments in socially guided exploration: lessons learned in building robots that learn with and without human teachers

TAMER: Training an Agent Manually via Evaluative Reinforcement

Toward gesture-based behavior authoring

Contact Info

Product

Resources

About