2004
DOI: 10.1016/j.ins.2003.03.005
|View full text |Cite
|
Sign up to set email alerts
|

A generic architecture for adaptive agents based on reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
9
0

Year Published

2007
2007
2017
2017

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 18 publications
0
9
0
Order By: Relevance
“…A typical mathematical model of a cart-pole system can be represented as Eqs. (12) and (13). Also, the cart-pole model by a personal computer is simulated using the fourth order Runge-Kutta [3].…”
Section: Cart-pole Simulationmentioning
confidence: 99%
See 1 more Smart Citation
“…A typical mathematical model of a cart-pole system can be represented as Eqs. (12) and (13). Also, the cart-pole model by a personal computer is simulated using the fourth order Runge-Kutta [3].…”
Section: Cart-pole Simulationmentioning
confidence: 99%
“…By taking action 'a' based on state 's', the algorithm evaluates the feedback rewards and updates its Q values from the rewards sequentially. In an acceptable learning period, the Q-learning algorithm performs excellently, as evidenced by its successful application to collision-avoidance, homing, and robot cooperation [4,6,7,9,13].…”
Section: Introductionmentioning
confidence: 99%
“…The theory of multi-agent system and distributed artificial intelligence (DAI) 0020 provides feasible technical support to modeling and realization of Intelligent Manufacturing System, which becomes one of the research hotspots in the manufacturing domain [27]. Manufacturing process is a typical multi-agent question solution process, and every department (or segment) in manufacturing system is equal to an Agent in the process.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning methods have been widely applied to a variety of problems, such as autonomous robot navigation and nonlinear control [7,22]. However, most conventional reinforcement learning methods use a discrete state space to describe the interaction between the agent and its environment.…”
Section: Introductionmentioning
confidence: 99%