Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy

Duminy, Nicolas; Nguyen, Sao Mai; Zhu, Junshuai; Duhaut, Dominique; Kerdreux, Jérôme

doi:10.3390/app11030975

Cited by 8 publications

(8 citation statements)

References 48 publications

(59 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Despite its simplicity, and to some extent because of it, this scenario allows us to focus on the main functions of the proposed system and analyse their contribution to the learning process. Furthermore, this type of task is typical of the IMOL literature [44], [58], [25], as well as our previous work on GRAIL system [37], [53]: this allows us to both place this study in continuity with previous ones, and facilitate a comparison with other systems by highlighting advances and differences.…”

Section: A Environment and Taskmentioning

confidence: 94%

“…The concept of Intrinsic Motivations (IMs) is borrowed from the biological [9] and psychological literature [10] describing how novel or unexpected "neutral" stimuli, as well as the perception of control, can drive learning processes in the absence of rewards or assigned goals. In the computational field, IMs have been implemented to foster different autonomous processes such as state-space exploration [11], [12], [13], knowledge gathering [14], [15], learning repertoire of skills [16], [17], [18], affordance exploitation [19], [20], goal selection [21], [22], [23], and also boosting imitation learning techniques [24], [25].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

Santucci

Montella

Baldassarre

2023

IEEE Trans. Cogn. Dev. Syst.

View full text Add to dashboard Cite

When facing the problem of autonomously learning to achieve multiple goals, researchers typically focus on problems where each goal can be solved using just one policy. However, in environments presenting different contexts, the same goal might need different skills to be solved. These situations pose two challenges: (a) recognise which are the contexts that need different policies to perform the goals; (b) learn the policies to accomplish the same goal in the identified relevant contexts. These two challenges are even harder if faced within an open-ended learning framework where potentially an agent has no information on the environment, possibly not even about the goals it can pursue. We propose a novel robotic architecture, Contextual GRAIL (C-GRAIL), that solves these challenges in an integrated fashion. The architecture is able to autonomously detect new relevant contexts and ignore irrelevant ones, on the basis of the decrease of the expected performance for a given goal. Moreover, C-GRAIL can quickly learn the policies for new contexts leveraging on transfer learning techniques. The architecture is tested in a simulated robotic environment involving a robot that autonomously discovers and learns to reach relevant target objects in the presence of multiple obstacles generating several different contexts.

show abstract

Section: A Environment and Taskmentioning

confidence: 94%

Section: Introductionmentioning

confidence: 99%

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

Santucci

Montella

Baldassarre

2023

IEEE Trans. Cogn. Dev. Syst.

View full text Add to dashboard Cite

show abstract

“…Such algorithms have been developed under the names of interactive reinforcement learning or active imitation learning in robotics. In Reference [85], they allowed the system to learn micro and compound actions, while minimizing the number of requests for labeled data by choosing when, what information to ask, and to whom to ask for help. Such principles could inspire a smart home system to continue to adapt its model, while minimizing user intervention and optimizing his intervention, by pointing out the missing key information.…”

Section: Temporal Driftmentioning

confidence: 99%

A Survey of Human Activity Recognition in Smart Homes Based on IoT Sensors Algorithms: Taxonomies, Challenges, and Opportunities with Deep Learning

Bouchabou

Nguyen

Lohr

et al. 2021

Sensors

Self Cite

108

View full text Add to dashboard Cite

Recent advances in Internet of Things (IoT) technologies and the reduction in the cost of sensors have encouraged the development of smart environments, such as smart homes. Smart homes can offer home assistance services to improve the quality of life, autonomy, and health of their residents, especially for the elderly and dependent. To provide such services, a smart home must be able to understand the daily activities of its residents. Techniques for recognizing human activity in smart homes are advancing daily. However, new challenges are emerging every day. In this paper, we present recent algorithms, works, challenges, and taxonomy of the field of human activity recognition in a smart home through ambient sensors. Moreover, since activity recognition in smart homes is a young field, we raise specific problems, as well as missing and needed contributions. However, we also propose directions, research opportunities, and solutions to accelerate advances in this field.

show abstract

“…It usually transforms and adjusts the parameters of the network model in the source domain and applies it to the target domain. [41][42][43][44][45] Because there are few time-series samples in the target domain, the real-time data come in batches. It is a very challenging task to retrain the model, and the target data prediction task cannot be completed.…”

Section: Transfer Learning Of Base Modelsmentioning

confidence: 99%

“…Transfer learning 37–40 is the process that uses the knowledge learned from the source domain to deal with the problems in the target domain. It usually transforms and adjusts the parameters of the network model in the source domain and applies it to the target domain 41–45 …”

Section: Base Model Transfermentioning

confidence: 99%

Multihorizons transfer strategy for continuous online prediction of time‐series data in complex systems

Zhou

Wang

2022

Int J of Intelligent Sys

View full text Add to dashboard Cite

The sustainability online prediction is of great significance for higher horizon time-series prediction in the future, and it embodies higher application value in equipment fault prediction and health management. However, compared with one-step time-series prediction, continuous online prediction faces many uncertainties, including error accumulation and lack of information. To realize continuous online prediction of time-series data in complex systems, this paper proposes a continuous online prediction strategy based on multihorizons transfer (OnMultiHorTS), which is used for continuous online prediction tasks of timeseries data. The algorithm aims to use source domain data to provide more effective information for target prediction tasks. However, the time-varying characteristics of time-series data often lead to large differences in data distribution over a long time span, which is difficult to guarantee the assumption that the data are the same distribution. How to construct more effective source domain information based on historical data and existing data, and apply it to the target domain prediction tasks, is one of the focuses of our OnMultiHorTS algorithm. In addition, different from the typical iterative and multistep advance prediction methods, the proposed algorithm regards different prediction tasks as different horizons, which are

show abstract

Intrinsically Motivated Open-Ended Multi-Task Learning Using Transfer Learning to Discover Task Hierarchy

Cited by 8 publications

References 48 publications

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

C-GRAIL: Autonomous Reinforcement Learning of Multiple and Context-Dependent Goals

A Survey of Human Activity Recognition in Smart Homes Based on IoT Sensors Algorithms: Taxonomies, Challenges, and Opportunities with Deep Learning

Multihorizons transfer strategy for continuous online prediction of time‐series data in complex systems

Contact Info

Product

Resources

About