论文标题

通过故障转移的在线需求安排

Online Demand Scheduling with Failovers

论文作者

Mellou, Konstantina, Molinaro, Marco, Zhou, Rudy

论文摘要

在云计算应用程序的启发下,我们研究了如何最佳部署新硬件的问题。为了模拟大型数据中心中观察到的情况,我们引入了与故障转移问题的在线需求计划。有$ m $相同的设备具有容量限制。需求是一对一的,要对设备故障进行鲁棒性,需要将其分配给一对设备。当设备故障(在故障转移方案中)时,分配给其配对设备的每个需求都将重新路由(现在可以以增加的容量运行)。目的是将需求分配给设备,以最大程度地利用总体利用率以及这些新颖的故障转移约束。这些后一种约束引入了经典作业问题(例如多重背包问题和AdWords)中不存在的新决策权衡。 在最差的模型中,我们设计了确定性的$ \ of frac {1} {2} $ - 竞争算法,并证明这基本上是紧密的。为了避免这种恒定的因素损失,在大云提供商的背景下,我们考虑了随机到达模型,所有需求都出现在其中。来自未知分布。在此模型中,我们设计了一种算法,该算法可实现亚线性添加剂后悔(即随着OPT或$ M $的增加,乘法竞争比率为$ 1 $)。这需要结合不同技术的组合,包括配置LP,具有非平凡的后处理步骤以及Rhee和Talagrand引入的在线单调匹配过程。

Motivated by cloud computing applications, we study the problem of how to optimally deploy new hardware subject to both power and robustness constraints. To model the situation observed in large-scale data centers, we introduce the Online Demand Scheduling with Failover problem. There are $m$ identical devices with capacity constraints. Demands come one-by-one and, to be robust against a device failure, need to be assigned to a pair of devices. When a device fails (in a failover scenario), each demand assigned to it is rerouted to its paired device (which may now run at increased capacity). The goal is to assign demands to the devices to maximize the total utilization subject to both the normal capacity constraints as well as these novel failover constraints. These latter constraints introduce new decision tradeoffs not present in classic assignment problems such as the Multiple Knapsack problem and AdWords. In the worst-case model, we design a deterministic $\approx \frac{1}{2}$-competitive algorithm, and show this is essentially tight. To circumvent this constant-factor loss, which in the context of big cloud providers represents substantial capital losses, we consider the stochastic arrival model, where all demands come i.i.d. from an unknown distribution. In this model we design an algorithm that achieves a sub-linear additive regret (i.e. as OPT or $m$ increases, the multiplicative competitive ratio goes to $1$). This requires a combination of different techniques, including a configuration LP with a non-trivial post-processing step and an online monotone matching procedure introduced by Rhee and Talagrand.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源