Neville Mehta scite author profile

Transfer learning seeks to leverage previously learned tasks to achieve faster learning in a new task. In this paper, we consider transfer learning in the context of related but distinct Reinforcement Learning (RL) problems. In particular, our RL problems are derived from Semi-Markov Decision Processes (SMDPs) that share the same transition dynamics but have different reward functions that are linear in a set of reward features. We formally define the transfer learning problem in the context of RL as learning an efficient algorithm to solve any SMDP drawn from a fixed distribution after experiencing a finite number of them. Furthermore, we introduce an online algorithm to solve this problem, Variable-Reward Reinforcement Learning (VRRL), that compactly stores the optimal value functions for several SMDPs, and uses them to optimally initialize the value function for a new SMDP. We generalize our method to a hierarchical RL setting where the different SMDPs share the same task hierarchy. Our experimental results in a simplified real-time strategy domain show that significant transfer learning occurs in both flat and hierarchical settings. Transfer is especially effective in the hierarchical setting where the overall value functions are decomposed into subtask value functions which are more widely amenable to transfer across different SMDPs.

show abstract

Automatic discovery and transfer of MAXQ hierarchies

Mehta

Ray

Tadepalli

et al. 2008

View full text Add to dashboard Cite

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful trajectory from a source reinforcement learning task. HI-MAT discovers subtasks by analyzing the causal and temporal relationships among the actions in the trajectory. Under appropriate assumptions, HI-MAT induces hierarchies that are consistent with the observed trajectory and have compact value-function tables employing safe state abstractions. We demonstrate empirically that HI-MAT constructs compact hierarchies that are comparable to manuallyengineered hierarchies and facilitate significant speedup in learning when transferred to a target task.

show abstract

Content based sub-image retrieval system for high resolution pathology images using salient interest points

Mehta

Alomari

Chaudhary

2009

View full text Add to dashboard Cite

Content-based image retrieval systems for digital pathology require sub-image retrieval rather than the whole image retrieval for the system to be of clinical use. Digital pathology images are huge in size and thus the pathologist is interested in retrieving specific structures from the whole images in the database along with the previous diagnosis of the retrieved sub-image. We propose a content-based sub-image retrieval system (sCBIR) framework for high resolution digital pathology images. We utilize scale-invariant feature extraction and present an efficient and robust searching mechanism for indexing the images as well as for query execution of sub-image retrieval. We present a working sCBIR system and show results of testing our system on a set of queries for specific structures of interest for pathologists in clinical use. The outcomes of the sCBIR system are compared to manual search and there is an 80% match in the top five searches.

show abstract

Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

et al. 2011

View full text Add to dashboard Cite

Sequential decision tasks present many opportunities for the study of transfer learning. A principal one among them is the existence of multiple domains that share the same underlying causal structure for actions. We describe an approach that exploits this shared causal structure to discover a hierarchical task structure in a source domain, which in turn speeds up learning of task execution knowledge in a new target domain. Our approach is theoretically justiﬁed and compares favorably to manually designed task hierarchies in learning efﬁciency in the target domain. We demonstrate that causally motivated task hierarchies transfer more robustly than other kinds of detailed knowledge that depend on the idiosyncrasies of the source domain and are hence less transferable.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Neville Mehta

Transfer in variable-reward hierarchical reinforcement learning

Automatic discovery and transfer of MAXQ hierarchies

Content based sub-image retrieval system for high resolution pathology images using salient interest points

Automatic Discovery and Transfer of Task Hierarchies in Reinforcement Learning

Contact Info

Product

Resources

About