论文标题
循环中人类操作员的物联网的多步且有弹性的预测Q学习算法:供水网络中的案例研究
A Multi-step and Resilient Predictive Q-learning Algorithm for IoT with Human Operators in the Loop: A Case Study in Water Supply Networks
论文作者
论文摘要
我们考虑了在存在故障组件的情况下为物联网推荐弹性和预测性动作的问题,考虑到人类操作员的存在来操纵代理商为围栏目的所见环境的信息。物联网网络被配制为具有已知拓扑的有向图,其目的是保持源和目标节点之间的恒定和弹性流。通过预测性和弹性Q学习算法评估了通过该网络的最佳途径,该算法考虑了有关故障的不规则操作的历史数据,以及人类操作员的反馈,这些数据被认为具有有关网络状态的额外信息,这些信息可能会涉及攻击。为了展示我们的方法,我们利用弗吉尼亚州阿灵顿县的匿名数据来计算智能水供应系统的预测性和弹性调度策略,同时避免(i)根据人类操作员(II)(ii)所检测到的所有漏洞或其他故障的人(II)攻击的所有位置。该方法既包含了人类的适应性,又结合了机器在水分配中实现最佳实施遏制和恢复动作的计算能力。
We consider the problem of recommending resilient and predictive actions for an IoT network in the presence of faulty components, considering the presence of human operators manipulating the information of the environment the agent sees for containment purposes. The IoT network is formulated as a directed graph with a known topology whose objective is to maintain a constant and resilient flow between a source and a destination node. The optimal route through this network is evaluated via a predictive and resilient Q-learning algorithm which takes into account historical data about irregular operation, due to faults, as well as the feedback from the human operators that are considered to have extra information about the status of the network concerning locations likely to be targeted by attacks. To showcase our method, we utilize anonymized data from Arlington County, Virginia, to compute predictive and resilient scheduling policies for a smart water supply system, while avoiding (i) all the locations indicated to be attacked according to human operators (ii) as many as possible neighborhoods detected to have leaks or other faults. This method incorporates both the adaptability of the human and the computation capability of the machine to achieve optimal implementation containment and recovery actions in water distribution.