Dhruv Malik scite author profile

Dhruv Malik

5Publications

45Citation Statements Received

32Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of California, Berkeley, Jamia Millia Islamia

Publications

Order By: Most citations

Pragmatic-Pedagogic Value Alignment

Fisac

Gates

Hamrick

et al. 2019

View full text Add to dashboard Cite

As intelligent systems gain autonomy and capability, it becomes vital to ensure that their objectives match those of their human users; this is known as the value-alignment problem. In robotics, value alignment is key to the design of collaborative robots that can integrate into human workflows, successfully inferring and adapting to their users' objectives as they go. We argue that a meaningful solution to value alignment must combine multi-agent decision theory with rich mathematical models of human cognition, enabling robots to tap into people's natural collaborative capabilities. We present a solution to the cooperative inverse reinforcement learning (CIRL) dynamic game based on well-established cognitive models of decision making and theory of mind. The solution captures a key reciprocity relation: the human will not plan her actions in isolation, but rather reason pedagogically about how the robot might learn from them; the robot, in turn, can anticipate this and interpret the human's actions pragmatically. To our knowledge, this work constitutes the first formal analysis of value alignment grounded in empirically validated cognitive models.

show abstract

Pragmatic-Pedagogic Value Alignment

Fisac¹,

Gates²,

Hamrick³

et al. 2017

Preprint

View full text Add to dashboard Cite

An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning

Malik¹,

Palaniappan²,

Fisac³

et al. 2018

Preprint

View full text Add to dashboard Cite

Our goal is for AI systems to correctly identify and act according to their human user's objectives. Cooperative Inverse Reinforcement Learning (CIRL) formalizes this value alignment problem as a two-player game between a human and robot, in which only the human knows the parameters of the reward function: the robot needs to learn them as the interaction unfolds. Previous work showed that CIRL can be solved as a POMDP, but with an action space size exponential in the size of the reward parameter space. In this work, we exploit a specific property of CIRL-the human is a full information agent-to derive an optimality-preserving modification to the standard Bellman update; this reduces the complexity of the problem by an exponential factor and allows us to relax CIRL's assumption of human rationality. We apply this update to a variety of POMDP solvers and find that it enables us to scale CIRL to non-trivial problems, with larger reward parameter spaces, and larger action spaces for both robot and human. In solutions to these larger problems, the human exhibits pedagogic (teaching) behavior, while the robot interprets it as such and attains higher value for the human.

show abstract

When Is Generalizable Reinforcement Learning Tractable?

Malik¹,

Li²,

Ravikumar³

2021

Preprint

View full text Add to dashboard Cite

Agents trained by reinforcement learning (RL) often fail to generalize beyond the environment they were trained in, even when presented with new scenarios that seem very similar to the training environment. We study the query complexity required to train RL agents that can generalize to multiple environments. Intuitively, tractable generalization is only possible when the environments are similar or close in some sense. To capture this, we introduce Strong Proximity, a structural condition which precisely characterizes the relative closeness of different environments. We provide an algorithm which exploits Strong Proximity to provably and efficiently generalize. We also show that under a natural weakening of this condition, which we call Weak Proximity, RL can require query complexity that is exponential in the horizon to generalize. A key consequence of our theory is that even when the environments share optimal trajectories, and have highly similar reward and transition functions (as measured by classical metrics), tractable generalization is impossible.

show abstract

Breaking an image encryption scheme based on chaotic synchronization phenomenon

Singh

Ahmad

Malik

2016

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dhruv Malik

Pragmatic-Pedagogic Value Alignment

Pragmatic-Pedagogic Value Alignment

An Efficient, Generalized Bellman Update For Cooperative Inverse Reinforcement Learning

When Is Generalizable Reinforcement Learning Tractable?

Breaking an image encryption scheme based on chaotic synchronization phenomenon

Contact Info

Product

Resources

About