论文标题

通过带宽分配模型的强化学习代理设计和优化

Reinforcement Learning Agent Design and Optimization with Bandwidth Allocation Model

论文作者

Reale, Rafael F., Martins, Joberto S. B.

论文摘要

强化学习(RL)目前用于各种现实生活中。基于RL的解决方案有潜力解决问题,包括难以解决启发式方法和元数据的问题,此外,还需要一些智能或认知方法的问题和问题。但是,加固学习师需要设计不直接的设计,并且存在重要的设计问题。 RL代理设计问题包括目标问题建模,州空间爆炸,训练过程和代理效率。目前的研究解决了旨在促进RL传播的问题。总而言之,BAM模型与用户分配并共享资源。有三种基本的BAM模型和几种混合动力车在用户之间分配和共享资源方面有所不同。本文解决了RL代理设计和效率的问题。 RL代理的目标是在用户之间分配和共享资源。该论文研究了BAM模型如何有助于RL代理设计和效率。分析和模拟了AlloCTC共享(ATC)模型,以评估其模拟RL代理操作以及ATC如何从RL代理中卸载计算任务。研究的基本论点是,与RL代理设计和操作集成的算法是否有可能促进代理设计并优化其执行。提出的ATC分析模型和仿真表明,BAM模型卸载代理任务并有助于代理的设计和优化。

Reinforcement learning (RL) is currently used in various real-life applications. RL-based solutions have the potential to generically address problems, including the ones that are difficult to solve with heuristics and meta-heuristics and, in addition, the set of problems and issues where some intelligent or cognitive approach is required. However, reinforcement learning agents require a not straightforward design and have important design issues. RL agent design issues include the target problem modeling, state-space explosion, the training process, and agent efficiency. Research currently addresses these issues aiming to foster RL dissemination. A BAM model, in summary, allocates and shares resources with users. There are three basic BAM models and several hybrids that differ in how they allocate and share resources among users. This paper addresses the issue of an RL agent design and efficiency. The RL agent's objective is to allocate and share resources among users. The paper investigates how a BAM model can contribute to the RL agent design and efficiency. The AllocTC-Sharing (ATCS) model is analytically described and simulated to evaluate how it mimics the RL agent operation and how the ATCS can offload computational tasks from the RL agent. The essential argument researched is whether algorithms integrated with the RL agent design and operation have the potential to facilitate agent design and optimize its execution. The ATCS analytical model and simulation presented demonstrate that a BAM model offloads agent tasks and assists the agent's design and optimization.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源