论文标题
循环图上的贝叶斯颗粒
Bayesian Particles on Cyclic Graphs
论文作者
论文摘要
我们考虑设计合成细胞以实现复杂目标(例如,通过寻求入侵者模仿免疫系统)的问题(例如,循环系统),在那里他们可能必须改变自己的控制政策,相互交流,并与包括虚假的积极性和否定性相互处理,包括最少的误差和否定,仅与最少的能力和几乎差异。 我们使用循环,迷宫样环境模拟免疫反应,并在未知位置使用目标来表示入侵的细胞。仅使用几个存储器,对合成细胞进行编程,以执行强化学习类型算法,它们基于与其他单元格的随机相遇来更新其控制策略。随着合成单元格一起找到目标,它们作为集合功能作为贝叶斯更新的物理实现。也就是说,颗粒充当粒子滤波器。 该结果提供了有关合成细胞集合的行为的正式特性,可用于确保鲁棒性和安全性。这种简化的增强学习方法在模拟中评估,并应用于人类循环系统的实际模型。
We consider the problem of designing synthetic cells to achieve a complex goal (e.g., mimicking the immune system by seeking invaders) in a complex environment (e.g., the circulatory system), where they might have to change their control policy, communicate with each other, and deal with stochasticity including false positives and negatives---all with minimal capabilities and only a few bits of memory. We simulate the immune response using cyclic, maze-like environments and use targets at unknown locations to represent invading cells. Using only a few bits of memory, the synthetic cells are programmed to perform a reinforcement learning-type algorithm with which they update their control policy based on randomized encounters with other cells. As the synthetic cells work together to find the target, their interactions as an ensemble function as a physical implementation of a Bayesian update. That is, the particles act as a particle filter. This result provides formal properties about the behavior of the synthetic cell ensemble that can be used to ensure robustness and safety. This method of simplified reinforcement learning is evaluated in simulations, and applied to an actual model of the human circulatory system.