论文标题

SARSA(0)对完全同态加密的增强学习

SARSA(0) Reinforcement Learning over Fully Homomorphic Encryption

论文作者

Suh, Jihoon, Tanaka, Takashi

论文摘要

我们考虑了基于云的控制体系结构,其中本地工厂将控制合成任务外包给云。特别是,我们考虑了基于云的增强学习(RL),其中更新值函数被外包到云中。为了实现机密性,我们对完全同构加密(FHE)实施计算。我们使用CKKS加密方案和修改后的SARSA(0)强化学习来合并加密引起的延迟。然后,我们通过阻止机制给出了SARSA(0)的延迟更新规则的收敛结果。我们最终通过实施经典的钢管平衡问题来提出数值演示。

We consider a cloud-based control architecture in which the local plants outsource the control synthesis task to the cloud. In particular, we consider a cloud-based reinforcement learning (RL), where updating the value function is outsourced to the cloud. To achieve confidentiality, we implement computations over Fully Homomorphic Encryption (FHE). We use a CKKS encryption scheme and a modified SARSA(0) reinforcement learning to incorporate the encryption-induced delays. We then give a convergence result for the delayed updated rule of SARSA(0) with a blocking mechanism. We finally present a numerical demonstration via implementing on a classical pole-balancing problem.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源