论文标题
SERDAB:用于分区多个飞地的神经网络计算的物联网框架
Serdab: An IoT Framework for Partitioning Neural Networks Computation across Multiple Enclaves
论文作者
论文摘要
深度神经网络(DNN)和Edge计算的最新进展使得可以自动通过包括边缘设备,靠近视频源的层级设备以及远程云计算资源来自动分析家庭/安全摄像机的视频流。但是,保留用户通过不同设备的敏感数据的隐私和机密性仍然是大多数用户的关注点。私人用户数据受到恶意攻击者的攻击或内部管理员滥用,他们可能在未经用户明确批准的活动中使用数据。为了应对这一挑战,我们提出了SERDAB,这是一个分布式的编排框架,用于在多个安全的飞地(例如Intel SGX)上部署深度神经网络计算。安全的飞地提供了有关其内部部署的数据/代码隐私的保证。但是,他们有限的硬件资源使它们仅运行整个深神经网络时效率低下。为了弥合这一差距,SERDAB提出了DNN分区策略,以通过多个飞地设备或Enclave设备和其他硬件加速器分配神经网络的层。与在一个飞地中执行整个神经网络相比,我们的分区策略达到了高达4.7倍的速度。
Recent advances in Deep Neural Networks (DNN) and Edge Computing have made it possible to automatically analyze streams of videos from home/security cameras over hierarchical clusters that include edge devices, close to the video source, as well as remote cloud compute resources. However, preserving the privacy and confidentiality of users' sensitive data as it passes through different devices remains a concern to most users. Private user data is subject to attacks by malicious attackers or misuse by internal administrators who may use the data in activities that are not explicitly approved by the user. To address this challenge, we present Serdab, a distributed orchestration framework for deploying deep neural network computation across multiple secure enclaves (e.g., Intel SGX). Secure enclaves provide a guarantee on the privacy of the data/code deployed inside it. However, their limited hardware resources make them inefficient when solely running an entire deep neural network. To bridge this gap, Serdab presents a DNN partitioning strategy to distribute the layers of the neural network across multiple enclave devices or across an enclave device and other hardware accelerators. Our partitioning strategy achieves up to 4.7x speedup compared to executing the entire neural network in one enclave.