Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems 2007
DOI: 10.1145/1329125.1329367
|View full text |Cite
|
Sign up to set email alerts
|

A globally optimal algorithm for TTD-MDPs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…MDPs were first used in the context of experience management by the Declarative Optimization-based Drama Management (DODM) system (Nelson et al 2006a;2006b), which built on the search-based drama management pioneered by the Oz Project (Bates 1992;Weyhrauch 1997). MDPs serve as the basis for targeted trajectory distribution markov decision processes (TTD-MDPs) (Roberts et al 2006;Bhat et al 2007;Cantino, Roberts, and Isbell 2007;Roberts, Cantino, and Isbell Jr. 2007;). Created to provide replayability in the context of experience management, TTD-MDPs model desired distributions over outcomes and trajectories instead of maximizing expected reward.…”
Section: Related Workmentioning
confidence: 99%
“…MDPs were first used in the context of experience management by the Declarative Optimization-based Drama Management (DODM) system (Nelson et al 2006a;2006b), which built on the search-based drama management pioneered by the Oz Project (Bates 1992;Weyhrauch 1997). MDPs serve as the basis for targeted trajectory distribution markov decision processes (TTD-MDPs) (Roberts et al 2006;Bhat et al 2007;Cantino, Roberts, and Isbell 2007;Roberts, Cantino, and Isbell Jr. 2007;). Created to provide replayability in the context of experience management, TTD-MDPs model desired distributions over outcomes and trajectories instead of maximizing expected reward.…”
Section: Related Workmentioning
confidence: 99%
“…TTD-MDPs are a statistical machine learning formalism that leverage probabilistic reasoning to create a "policy" specifying the relative probability of short term goals in relation to the authors overall criteria. They have been applied to the goal selection problem in DODM (Roberts et al 2006;Bhat et al 2007;Roberts et al 2007;Roberts 2010).…”
Section: Representation Of Authors' Criteriamentioning
confidence: 99%
“…Techniques have been developed for tightly coupling plot creation and character behavior in dialogue-oriented interactive stories [13], for monitoring users' actions to determine if they are threatening the plot and, if so, either accommodating the new development or intervening [11] [18]. Search-based approaches utilize an evaluation function encoding author aesthetics to guide narrative planning [15] [22], while reinforcement learning has been used to learn policies to guide narrative action selection [16], and Targeted Trajectory Distribution Markov Decision Processes (TTD-MDP) have been employed to enhance replayability [4] [19]. Other narrative planners have taken decision-theoretic approaches to narrative management [14][21].…”
Section: Introductionmentioning
confidence: 99%