2021
DOI: 10.1609/aaai.v35i1.16155
|View full text |Cite
|
Sign up to set email alerts
|

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

Abstract: We solve a challenging yet practically useful variant of 3D Bin Packing Problem (3D-BPP). In our problem, the agent has limited information about the items to be packed into a single bin, and an item must be packed immediately after its arrival without buffering or readjusting. The item's placement also subjects to the constraints of order dependence and physical stability. We formulate this online 3D-BPP as a constrained Markov decision process (CMDP). To solve the problem, we propose an effective and easy-to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
30
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 68 publications
(38 citation statements)
references
References 37 publications
0
30
0
Order By: Relevance
“…This work also discusses multiple constraints and criteria that should be considered in online BPPs. Zhao et al discretise the bottom of a bin as the grid, with grid points to be selected for locating the front-leftbottom corner of a box, and optimise space utilisation rate in 3D-BPP by an on-policy actor-critic algorithm [21,109]. This method is limited to integer constrained box sizes and employ an additional neural network to predict infeasible grid points for pruning solution space.…”
Section: On-line Bppsmentioning
confidence: 99%
See 3 more Smart Citations
“…This work also discusses multiple constraints and criteria that should be considered in online BPPs. Zhao et al discretise the bottom of a bin as the grid, with grid points to be selected for locating the front-leftbottom corner of a box, and optimise space utilisation rate in 3D-BPP by an on-policy actor-critic algorithm [21,109]. This method is limited to integer constrained box sizes and employ an additional neural network to predict infeasible grid points for pruning solution space.…”
Section: On-line Bppsmentioning
confidence: 99%
“…This method is limited to integer constrained box sizes and employ an additional neural network to predict infeasible grid points for pruning solution space. To handle continuous sizes of boxes, the authors also propose a tree search-based learning method, where leaf nodes represent candidate placements for the next item, and they use attention-based neural network to select a leaf node and spread the tree [18]. Yang et al reflect the promising candidate actions (derived from heuristics) as MDP rewards for the agent, so as to guide the policy learning via PPO [107].…”
Section: On-line Bppsmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to efficiently generate the optimal sequence and placement of objects, heuristic methods [1], [3], [4] with the greedy objective minimize the object stack heights in the packing boxes. Since the greedy search results in sub-optimal solution and high computational cost for object arrangement, data-driven methods [2], [5], [6] employ the reinforcement learning framework for bin packing. However, the objects for packing in realistic applications are usually irregular.…”
Section: Introductionmentioning
confidence: 99%