Er Meng Joo scite author profile

Purpose -This purpose of this paper is to provide an overview of the theoretical background and applications of inverse reinforcement learning (IRL). Design/methodology/approach -Reinforcement learning (RL) techniques provide a powerful solution for sequential decision making problems under uncertainty. RL uses an agent equipped with a reward function to find a policy through interactions with a dynamic environment. However, one major assumption of existing RL algorithms is that reward function, the most succinct representation of the designer's intention, needs to be provided beforehand. In practice, the reward function can be very hard to specify and exhaustive to tune for large and complex problems, and this inspires the development of IRL, an extension of RL, which directly tackles this problem by learning the reward function through expert demonstrations. In this paper, the original IRL algorithms and its close variants, as well as their recent advances are reviewed and compared. Findings -This paper can serve as an introduction guide of fundamental theory and developments, as well as the applications of IRL. Originality/value -This paper surveys the theories and applications of IRL, which is the latest development of RL and has not been done so far.

show abstract

H∞ state-feedback control of time-delay systems using reciprocally convex approach

Vembarasan

Balasubramaniam

Joo

2014

Journal of Process Control

View full text Add to dashboard Cite

A review of inverse reinforcement learning theory and recent advances

Shao

Joo

2012

View full text Add to dashboard Cite

A Novel Fuzzy Logic Model for Pseudo-Relevance Feedback-Based Query Expansion

Singh

Prasad

et al. 2016

Int. J. Fuzzy Syst.

View full text Add to dashboard Cite

Design for auto-tuning PID controller based on genetic algorithms

Liu

Joo

2009

View full text Add to dashboard Cite

A survey on artificial intelligence technologies in modeling of High Speed end-milling processes

Torabi

Joo

et al. 2009

View full text Add to dashboard Cite

High Speed Machining centers (HSM) are considered as complicated industrial instruments. Finishing is a critical process in production procedure which is carried on by these machines. Among many types of cutters, ball-nose cutters are the preferred cutters to do these kinds of operations since they have extensive operating cutting edges and appropriate geometry. The main aim of the researches on cutting process is to understand its nature better and to use this knowledge to improve the quality of the product. To achieve this goal, it is necessary to have a descriptive reference model on the process using experiments' data. Increasing demands for better surfacefinishing and concurrently the development of the available measurement instruments and modeling techniques make the methods and approaches to be novel. Present paper is a survey on the lack of literature on the state-of-the-art modeling paradigms of milling processes, mainly on ball-nose cutters for surface finishing.

show abstract

Parallel Region Coverage Using Multiple UAVs

Agarwal

Hiot

Nghia

et al.

View full text Add to dashboard Cite

Extended Approach of ANFIS in Cascade Control

Pratama¹,

Rajab²,

Joo³

2011

IJCEE

View full text Add to dashboard Cite

Abstract-this paper discusses extended approaches of Adaptive Network Based Fuzzy Inference System (ANFIS) using Self Tuning Regulator (STR) via Fuzzy logic to be implemented in cascade control. Plant used is Pressure RIG 38-714 which supports cascade configuration. The controlled variable in the outer loop is pressure, in the inner loop's flow. The proposed method's used to improve the performance of ANFIS which has been commonly implemented to accomplish control task. Comparison has been conducted between ANFIS using STR and without STR, from experiment could be concluded that controller ANFIS using STR acquired better performance than only used ANFIS. It's derived from the transient response of those. For ANFIS using STR is obtained rise time and settling time are 9 ms and 12 ms respectively. In the other hand, ANFIS without STR resulted 21 ms and 29 ms for rise time and settling time respectively.

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Er Meng Joo

A survey of inverse reinforcement learning techniques

H∞ state-feedback control of time-delay systems using reciprocally convex approach

A review of inverse reinforcement learning theory and recent advances

A Novel Fuzzy Logic Model for Pseudo-Relevance Feedback-Based Query Expansion

Design for auto-tuning PID controller based on genetic algorithms

A survey on artificial intelligence technologies in modeling of High Speed end-milling processes

Parallel Region Coverage Using Multiple UAVs

Extended Approach of ANFIS in Cascade Control

Contact Info

Product

Resources

About