论文标题
化名论坛用户对任务的批处理:匿名性妥协和保护
Batching of Tasks by Users of Pseudonymous Forums: Anonymity Compromise and Protection
论文作者
论文摘要
在许多论坛上,人们会在假名下参加。一个例子是同行评审,其中任何论文的审阅者身份都是机密的。当参加这些论坛时,人们经常从事“批处理”:几乎同时执行多个相关任务(例如,对多个论文发表评论)。我们的经验分析表明,在我们考虑$ \ unicode {x2013} $对等评论和Wikipedia编辑的两个应用中,批处理很常见。在本文中,我们识别并解决了通过链接批处理任务引起的脱名单的风险。为了防止连锁攻击,我们采取了将延迟延迟到批处理任务的发布时间的方法。我们首先表明,在某些自然假设下,没有延迟机制可以提供有意义的差异隐私保证。因此,我们建议对差分隐私的“单方面”制定,以防止连锁攻击。我们设计了一种机制,将事件添加为零膨胀的统一延迟,并表明它可以保留隐私。我们证明,这种噪声分布实际上是最大程度地减少每个事件的独立噪声的预期延迟的最佳选择,从而确立了批处理事件和未捕获事件的预期延迟之间权衡的帕累托前沿。最后,我们对Wikipedia和比特币数据进行了一系列实验,这些实验证实了我们算法在混淆批处理方面的实际实用性,而无需将繁重的延迟引入系统。
There are a number of forums where people participate under pseudonyms. One example is peer review, where the identity of reviewers for any paper is confidential. When participating in these forums, people frequently engage in "batching": executing multiple related tasks (e.g., commenting on multiple papers) at nearly the same time. Our empirical analysis shows that batching is common in two applications we consider $\unicode{x2013}$ peer review and Wikipedia edits. In this paper, we identify and address the risk of deanonymization arising from linking batched tasks. To protect against linkage attacks, we take the approach of adding delay to the posting time of batched tasks. We first show that under some natural assumptions, no delay mechanism can provide a meaningful differential privacy guarantee. We therefore propose a "one-sided" formulation of differential privacy for protecting against linkage attacks. We design a mechanism that adds zero-inflated uniform delay to events and show it can preserve privacy. We prove that this noise distribution is in fact optimal in minimizing expected delay among mechanisms adding independent noise to each event, thereby establishing the Pareto frontier of the trade-off between the expected delay for batched and unbatched events. Finally, we conduct a series of experiments on Wikipedia and Bitcoin data that corroborate the practical utility of our algorithm in obfuscating batching without introducing onerous delay to a system.