Shared autonomy via hindsight optimization for teleoperation and teaming

Javdani, Shervin; Admoni, Henny; Pellegrinelli, Stefania; Srinivasa, Siddhartha S.; Bagnell, J. Andrew

doi:10.1177/0278364918776060

Cited by 165 publications

(166 citation statements)

References 76 publications

Supporting

Mentioning

152

Contrasting

Order By: Relevance

“…As mentioned in the introduction, a key motivation behind learning agent models is to enable effective human-machine interaction. The algorithms presented in this work can be used both by a collaborative robot (for learning models of human behavior to improve robot decision-making) and by a human (to learn transparent models of robot behavior to better calibrate trust) (Javdani et al 2018;Yang et al 2017). Both of these use cases offer novel research avenues, such as the efficient specification of decision-theoretic models for interaction (such as, POMDPs and decentralized-POMDPs) and evaluation of the utility of aligned models as compared to purely predictive models for human-machine interaction (Oliehoek, Amato, and others 2016).…”

Section: Discussionmentioning

confidence: 99%

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Unhelkar

Shah

2019

AAAI

View full text Add to dashboard Cite

Artificial agents that interact with other (human or artificial) agents require models in order to reason about those other agents’ behavior. In addition to the predictive utility of these models, maintaining a model that is aligned with an agent’s true generative model of behavior is critical for effective human-agent interaction. In applications wherein observations and partial specification of the agent’s behavior are available, achieving model alignment is challenging for a variety of reasons. For one, the agent’s decision factors are often not completely known; further, prior approaches that rely upon observations of agents’ behavior alone can fail to recover the true model, since multiple models can explain observed behavior equally well. To achieve better model alignment, we provide a novel approach capable of learning aligned models that conform to partial knowledge of the agent’s behavior. Central to our approach are a factored model of behavior (AMM), along with Bayesian nonparametric priors, and an inference approach capable of incorporating partial specifications as constraints for model learning. We evaluate our approach in experiments and demonstrate improvements in metrics of model alignment.

show abstract

Section: Discussionmentioning

confidence: 99%

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Unhelkar

Shah

2019

AAAI

View full text Add to dashboard Cite

show abstract

“…Memory-based inference is utilized in previous works involving shared autonomy and human-robot systems [8,9,30]. Another approach uses Laplace's approximation [9] and formulates the problem as one of optimizing a partially observable Markov decision process (POMDP) over the user's goal to arbitrate control over a distribution of possible outcomes [20]. The approach considers user inputs for the prediction model and uses a hand-specified distance-based user cost function to achieve a closed-form value function computation.…”

Section: Related Workmentioning

confidence: 99%

“…In addition to a distancebased likelihood observation used in the prior works, our approach incorporates a fusion of multiple observations and introduces a probabilistic modeling of user control inputs as goal-directed actions that customizes the rationality index value to each individual user, and thus can account for their particular behavior. The motivation for our approach are prior studies that show that users vary greatly in their performance, preferences, and desires [9,20,37]-suggesting a need for assistive systems to customize to individual users.…”

Section: Related Workmentioning

confidence: 99%

Probabilistic Human Intent Recognition for Shared Autonomy in Assistive Robotics

Jain

Argall

2019

J. Hum.-Robot Interact.

View full text Add to dashboard Cite

Effective human-robot collaboration in shared autonomy requires reasoning about the intentions of the human partner. To provide meaningful assistance, the autonomy has to first correctly predict, or infer, the intended goal of the human collaborator. In this work, we present a mathematical formulation for intent inference during assistive teleoperation under shared autonomy. Our recursive Bayesian filtering approach models and fuses multiple non-verbal observations to probabilistically reason about the intended goal of the user without explicit communication. In addition to contextual observations, we model and incorporate the human agent's behavior as goal-directed actions with adjustable rationality to inform intent recognition. Furthermore, we introduce a user-customized optimization of this adjustable rationality to achieve user personalization. We validate our approach with a human subjects study that evaluates intent inference performance under a variety of goal scenarios and tasks. Importantly, the studies are performed using multiple control interfaces that are typically available to users in the assistive domain, which differ in the continuity and dimensionality of the issued control signals. The implications of the control interface limitations on intent inference are analyzed. The study results show that our approach in many scenarios outperforms existing solutions for intent inference in assistive teleoperation and otherwise performs comparably. Our findings demonstrate the benefit of probabilistic modeling and the incorporation of human agent behavior as goaldirected actions where the adjustable rationality model is user customized. Results further show that the underlying intent inference approach directly affects shared autonomy performance, as do control interface limitations.

show abstract

“…Intent can also be inferred from the user's control signals and other environmental cues using various algorithms [11]. Within the context of shared autonomy a Bayesian scheme for user intent prediction models the user within the Markov Decision Process framework [12], [13], [14] and is typically assumed to be noisily optimizing some cost function for their intended goal. In low-dimensional spaces, this cost function can be learned from expert demonstrations using Inverse Reinforcement Learning [15].…”

Section: Related Workmentioning

confidence: 99%

Active Intent Disambiguation for Shared Control Robots

Gopinath

Argall

2020

IEEE Trans. Neural Syst. Rehabil. Eng.

View full text Add to dashboard Cite

Assistive shared-control robots have the potential to transform the lives of millions of people afflicted with severe motor impairments. The usefulness of shared-control robots typically relies on the underlying autonomy's ability to infer the user's needs and intentions, and the ability to do so unambiguously is often a limiting factor for providing appropriate assistance confidently and accurately. The contributions of this paper are four-fold. First, we introduce the idea of intent disambiguation via control mode selection, and present a mathematical formalism for the same. Second, we develop a control mode selection algorithm which selects the control mode in which the user-initiated motion helps the autonomy to maximally disambiguate user intent. Third, we present a pilot study with eight subjects to evaluate the efficacy of the disambiguation algorithm. Our results suggest that the disambiguation system (a) helps to significantly reduce task effort, as measured by number of button presses, and (b) is of greater utility for more limited control interfaces and more complex tasks. We also observe that (c) subjects demonstrated a wide range of disambiguation request behaviors, with the common thread of concentrating requests early in the execution. As our last contribution, we introduce a novel field-theoretic approach to intent inference inspired by dynamic field theory that works in tandem with the disambiguation scheme.

show abstract

Shared autonomy via hindsight optimization for teleoperation and teaming

Cited by 165 publications

References 76 publications

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Learning Models of Sequential Decision-Making with Partial Specification of Agent Behavior

Probabilistic Human Intent Recognition for Shared Autonomy in Assistive Robotics

Active Intent Disambiguation for Shared Control Robots

Contact Info

Product

Resources

About