论文标题

猜测成本:分布式存储中数据修复的界限和应用

Guessing Cost: Bounds and Applications to Data Repair in Distributed Storage

论文作者

Arslan, Suayb S., Haytaoglu, Elif

论文摘要

猜测是指准确猜测随机变量所需的最小试验数量的分布。在这项研究中,引入了称为猜测成本的猜测(也称为猜测成本)的猜测,并引入了找到$ρ$ the猜测成本的最佳策略,用于在有限的设置上定义的随机变量,每个选择都与正有有限的成本值相关联(单位成本相对应)。此外,我们在猜测成本瞬间的对数上渐近地驾驶上下界限。与以前关于猜测的研究类似,在猜测成本的时刻建立了界限,量化了正确识别未知选择所需的猜测的累积成本,并根据Rényi的熵表示。此外,引入了新的随机变量,以在猜测成本与猜测工作之间建立联系,从而导致引起的策略。建立这种隐式联系有助于我们获得了非反应区域的改善界限。结果,我们通过考虑一系列具有不同成本分布的独立随机变量来建立在​​猜测成本的时刻,从猜测成本的时刻建立猜测成本指数。最后,通过对原始问题进行了轻微的修改,这些结果可用于界定由基站支持并受到两部分图形代码保护的分布式数据存储系统的整体修复带宽。

The guesswork refers to the distribution of the minimum number of trials needed to guess a realization of a random variable accurately. In this study, a non-trivial generalization of the guesswork called guessing cost (also referred to as cost of guessing) is introduced, and an optimal strategy for finding the $ρ$-th moment of guessing cost is provided for a random variable defined on a finite set whereby each choice is associated with a positive finite cost value (unit cost corresponds to the original guesswork). Moreover, we drive asymptotically tight upper and lower bounds on the logarithm of guessing cost moments. Similar to previous studies on the guesswork, established bounds on the moments of guessing cost quantify the accumulated cost of guesses required for correctly identifying the unknown choice and are expressed in terms of Rényi's entropy. Moreover, new random variables are introduced to establish connections between the guessing cost and the guesswork, leading to induced strategies. Establishing this implicit connection helped us obtain improved bounds for the non-asymptotic region. As a consequence, we establish the guessing cost exponent in terms of Rényi entropy rate on the moments of the guessing cost using the optimal strategy by considering a sequence of independent random variables with different cost distributions. Finally, with slight modifications to the original problem, these results are shown to be applicable for bounding the overall repair bandwidth for distributed data storage systems backed up by base stations and protected by bipartite graph codes.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源