제목 | MuZero 강화학습을 이용한 항만 컨테이너 재정돈 계획 |
---|---|
Title | Container Pre-Marshalling with MuZero Reinforcement Learning Algorithm |
저자 |
최원균 (한국항공대학교 항공교통물류학과) 이기주 (한국항공대학교 항공교통물류학과) 채준재* (한국항공대학교 항공교통물류학과) |
Author |
Wongyun Choi(School of Air Transport, Transportation and Logistics, Korea Aerospace University) Keyju Lee(School of Air Transport, Transportation and Logistics, Korea Aerospace University) Junjae Chae†(School of Air Transport, Transportation and Logistics, Korea Aerospace University) |
Bibliography | Journal of Logistics Science & Technology, 2(2),19~37, 2021, |
DOI | |
Key Words | Pre-Marshalling, Container Reshuffling, Reinforcement Learning, MuZero |
Abstract | This study provides a new solution approach for container pre-marshalling problems using a reinforcement learning method, the MuZero Algorithm. We have developed a customized pre-marshalling environment for an agent to be trained. To facilitate the training, we have devised i) some action masking methods, and ii) a reward function where lower bound for the number of rehandling is considered. Action masking methods and heuristically found lower bound were proven to be helpful in the learning process. Experiments with different sizes (from the minimum number of 8 and the maximum number of 12 containers) were carried out, within a stacking space of six rows and six tiers. Experimental results show that the MuZero algorithm implemented with our strategy is capable of training to solve the pre-marshalling problems for some small sized problems. |
PDF download | JM_2[2]-P19~37.pdf |