论文标题
部分可观测时空混沌系统的无模型预测
Pretraining in Deep Reinforcement Learning: A Survey
论文作者
论文摘要
在过去的几年中,将加强学习(RL)与深度学习相结合。从游戏到机器人技术的各种突破都引起了人们对设计复杂的RL算法和系统的兴趣。但是,RL的主要工作流程是学习Tabula Rasa,这可能会导致计算效率低下。这排除了RL算法的连续部署,并有可能排除没有大规模计算资源的研究人员。在机器学习的许多其他领域,预处理的范式已证明可以有效地获取可转移的知识,可以将其用于各种下游任务。最近,我们看到对深入RL的预处理的兴趣激增,并有令人鼓舞的结果。但是,许多研究都是基于不同的实验环境。由于RL的性质,该领域的预处理面临着独特的挑战,因此需要新的设计原则。在这项调查中,我们试图系统地检查预处理研究的现有作品,提供这些方法的分类法,讨论每个子场,并引起人们对开放问题和未来方向的关注。
The past few years have seen rapid progress in combining reinforcement learning (RL) with deep learning. Various breakthroughs ranging from games to robotics have spurred the interest in designing sophisticated RL algorithms and systems. However, the prevailing workflow in RL is to learn tabula rasa, which may incur computational inefficiency. This precludes continuous deployment of RL algorithms and potentially excludes researchers without large-scale computing resources. In many other areas of machine learning, the pretraining paradigm has shown to be effective in acquiring transferable knowledge, which can be utilized for a variety of downstream tasks. Recently, we saw a surge of interest in Pretraining for Deep RL with promising results. However, much of the research has been based on different experimental settings. Due to the nature of RL, pretraining in this field is faced with unique challenges and hence requires new design principles. In this survey, we seek to systematically review existing works in pretraining for deep reinforcement learning, provide a taxonomy of these methods, discuss each sub-field, and bring attention to open problems and future directions.