论文标题
OR-GYM:用于操作研究问题的增强学习库
OR-Gym: A Reinforcement Learning Library for Operations Research Problems
论文作者
论文摘要
强化学习(RL)已被广泛应用于游戏玩法,并超过了许多领域的人类水平的最佳性能,但是在工业或商业环境中,几乎没有用例。我们介绍了Or-Gym,这是一个开源库,用于开发加强学习算法来解决运营研究问题。在本文中,我们将加强学习应用于背包,多维垃圾箱包装,多回波供应链和多期资产分配模型问题,以及针对MILP和启发式模型的RL解决方案。这些问题用于物流,财务,工程,并且在许多业务运营设置中很常见。我们基于文献中的原型模型开发环境,并实施各种优化和启发式模型,以基于RL结果。通过将一系列经典优化问题重新列为RL任务,我们试图为运营研究社区提供新的工具,同时还将RL社区中的这些工具开放给RL社区中的这些工具,以应对或领域中的许多问题和挑战。
Reinforcement learning (RL) has been widely applied to game-playing and surpassed the best human-level performance in many domains, yet there are few use-cases in industrial or commercial settings. We introduce OR-Gym, an open-source library for developing reinforcement learning algorithms to address operations research problems. In this paper, we apply reinforcement learning to the knapsack, multi-dimensional bin packing, multi-echelon supply chain, and multi-period asset allocation model problems, as well as benchmark the RL solutions against MILP and heuristic models. These problems are used in logistics, finance, engineering, and are common in many business operation settings. We develop environments based on prototypical models in the literature and implement various optimization and heuristic models in order to benchmark the RL results. By re-framing a series of classic optimization problems as RL tasks, we seek to provide a new tool for the operations research community, while also opening those in the RL community to many of the problems and challenges in the OR field.