2009
DOI: 10.1016/j.eswa.2007.09.050
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic packaging in e-retailing with stochastic demand over finite horizons: A Q-learning approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 19 publications
(21 reference statements)
0
1
0
Order By: Relevance
“…They considered an infinite horizon learning problem where there is no deadline for the sale of stock. Cheng [28] applied a QL algorithm for RL to solve dynamic pricing problems for selling a given stock with a finite horizon. The study investigated the pricing process and how an RL framework is used to set prices dynamically to adapt to uncertain demand and large-scale states.…”
Section: B Reinforcement Learning For Dynamic Pricingmentioning
confidence: 99%
“…They considered an infinite horizon learning problem where there is no deadline for the sale of stock. Cheng [28] applied a QL algorithm for RL to solve dynamic pricing problems for selling a given stock with a finite horizon. The study investigated the pricing process and how an RL framework is used to set prices dynamically to adapt to uncertain demand and large-scale states.…”
Section: B Reinforcement Learning For Dynamic Pricingmentioning
confidence: 99%
“…And in some examples of book [7], tabular-value function is used and excellent results were achieved. However, the size of the table may be considerable because of the excessive amount of memory needed to store the table [8]. In order to deal with the continuous state space, some approximation methods are taken.…”
Section: Introductionmentioning
confidence: 99%
“…Reinforcement learning offers the advantage of formulation of a mathematical model based on multiple variables without any pre-definition of non-linear structure of the model, (Jiang andSheng, 2009, Dorca et al, 2013). Applications of reinforcement learning in the context of expert systems include, among others, goal-regulation in manufacturing systems (Shin et al, 2012), real time rescheduling (Palombarini and Martinez, 2012), inventory control in supply chain management (Kwon et al, 2008;Jiang and Sheng, 2009), and real-time dynamic packaging for e-commerce (Cheng, 2009). Our research similarly uses the advantages of using a model-free approach offered by reinforcement learning algorithm but is applied in a different domain i.e, the dynamic pricing of multiple interdependent products.…”
Section: Introductionmentioning
confidence: 99%