论文标题
与云中的Docker容器进行深入学习推断的经验质量计划的经验质量
Differentiate Quality of Experience Scheduling for Deep Learning Inferences with Docker Containers in the Cloud
论文作者
论文摘要
随着大数据驱动的应用程序的普遍性,例如智能手机的面部识别以及Google Ads的量身定制建议,我们正处于一种比以往任何时候都更加智能的生活方式的道路。各种神经网络动力的模型正在其智能的后端运行,以便对用户进行快速响应。支持这些模型需要大量基于云的计算资源,例如CPU和GPU。云提供商通过他们所占据的资源数量向客户收取。客户必须平衡预算和经验质量(例如响应时间)。预算依靠个人企业主,所需的经验质量(QOE)取决于不同应用程序的使用情况。例如,自动驾驶汽车需要实时响应,但是解锁智能手机可以忍受延迟。但是,云提供商未能为其客户提供基于QOE的选项。在本文中,我们提出了DQOES,是针对深度学习推论的经验调度程序的差异化质量。 DQOE接受客户对目标QOE的规格,并动态调整资源以实现其目标。通过广泛的基于云的实验,DQOE表明,与现有系统相比,它可以针对各种QOE安排多个并发作业,并实现多达8倍的满意模型
With the prevalence of big-data-driven applications, such as face recognition on smartphones and tailored recommendations from Google Ads, we are on the road to a lifestyle with significantly more intelligence than ever before. Various neural network powered models are running at the back end of their intelligence to enable quick responses to users. Supporting those models requires lots of cloud-based computational resources, e.g., CPUs and GPUs. The cloud providers charge their clients by the amount of resources that they occupy. Clients have to balance the budget and quality of experiences (e.g., response time). The budget leans on individual business owners, and the required Quality of Experience (QoE) depends on usage scenarios of different applications. For instance, an autonomous vehicle requires an real-time response, but unlocking your smartphone can tolerate delays. However, cloud providers fail to offer a QoE-based option to their clients. In this paper, we propose DQoES, differentiated quality of experience scheduler for deep learning inferences. DQoES accepts clients' specifications on targeted QoEs, and dynamically adjusts resources to approach their targets. Through the extensive cloud-based experiments, DQoES demonstrates that it can schedule multiple concurrent jobs with respect to various QoEs and achieve up to 8x times more satisfied models when compared to the existing system