论文标题
局部敏感的散列,并具有扩展的差异隐私
Locality Sensitive Hashing with Extended Differential Privacy
论文作者
论文摘要
扩展的差异隐私是使用一般度量对标准差异隐私(DP)的概括,已广泛研究以提供严格的隐私保证,同时保持高公用事业。但是,关于扩展DP的现有作品仅限于少数指标,例如欧几里得指标。因此,他们只有少量的应用程序,例如基于位置的服务和文档处理。在本文中,我们提出了几种提供不同度量的DP的机制:角度距离(或余弦距离)。我们的机制基于局部敏感的散列(LSH),可应用于角度距离,并且可以很好地适用于高维空间中的个人数据。我们从理论上分析了机制的隐私属性,并通过考虑LSH仅保留原始度量大约保留的输入数据的扩展DP。我们将机制应用于基于本地模型中具有角距离的高维个人数据的朋友匹配,并使用两个真实数据集评估我们的机制。我们表明,LDP需要非常大的隐私预算,并且会议在此应用程序中不起作用。然后,我们证明我们的机制使朋友可以根据扩展的DP匹配高公用事业和严格的隐私权。
Extended differential privacy, a generalization of standard differential privacy (DP) using a general metric, has been widely studied to provide rigorous privacy guarantees while keeping high utility. However, existing works on extended DP are limited to few metrics, such as the Euclidean metric. Consequently, they have only a small number of applications, such as location-based services and document processing. In this paper, we propose a couple of mechanisms providing extended DP with a different metric: angular distance (or cosine distance). Our mechanisms are based on locality sensitive hashing (LSH), which can be applied to the angular distance and work well for personal data in a high-dimensional space. We theoretically analyze the privacy properties of our mechanisms, and prove extended DP for input data by taking into account that LSH preserves the original metric only approximately. We apply our mechanisms to friend matching based on high-dimensional personal data with angular distance in the local model, and evaluate our mechanisms using two real datasets. We show that LDP requires a very large privacy budget and that RAPPOR does not work in this application. Then we show that our mechanisms enable friend matching with high utility and rigorous privacy guarantees based on extended DP.