한국물류과학기술학회

학술지

HOME

제목	MuZero 강화학습을 이용한 항만 컨테이너 재정돈 계획
Title	Container Pre-Marshalling with MuZero Reinforcement Learning Algorithm
저자	최원균 (한국항공대학교 항공교통물류학과) 이기주 (한국항공대학교 항공교통물류학과) 채준재* (한국항공대학교 항공교통물류학과)
Author	Wongyun Choi(School of Air Transport, Transportation and Logistics, Korea Aerospace University) Keyju Lee(School of Air Transport, Transportation and Logistics, Korea Aerospace University) Junjae Chae†(School of Air Transport, Transportation and Logistics, Korea Aerospace University)
Bibliography	Journal of Logistics Science & Technology, 2(2),19~37, 2021,
DOI
Key Words	Pre-Marshalling, Container Reshuffling, Reinforcement Learning, MuZero
Abstract	This study provides a new solution approach for container pre-marshalling problems using a reinforcement learning method, the MuZero Algorithm. We have developed a customized pre-marshalling environment for an agent to be trained. To facilitate the training, we have devised i) some action masking methods, and ii) a reward function where lower bound for the number of rehandling is considered. Action masking methods and heuristically found lower bound were proven to be helpful in the learning process. Experiments with different sizes (from the minimum number of 8 and the maximum number of 12 containers) were carried out, within a stacking space of six rows and six tiers. Experimental results show that the MuZero algorithm implemented with our strategy is capable of training to solve the pre-marshalling problems for some small sized problems.
PDF download	JM_2[2]-P19~37.pdf

해당호수 목록으로 이동 전체 학술지 목록으로 가기