有效地量化社交网络中的匹配风险

论文标题

有效地量化社交网络中的匹配风险

Efficient Quantification of Profile Matching Risk in Social Networks

论文作者

Halimi, Anisa, Ayday, Erman

论文摘要

在当今相互联系的数字世界中，匿名数据共享变得越来越具有挑战性，尤其是对于拥有匿名和确定在线活动的个人而言。当今此类数据共享平台最突出的例子是在线社交网络（OSN）。许多人在不同的OSN中具有多个配置文件，包括匿名和确定的OSN（取决于OSN的性质）。在这里，隐私威胁与个人资料匹配：如果攻击者将个人的匿名概况与其真实身份联系起来，则可以获得对隐私敏感的信息，这些信息可能会带来严重的后果，例如歧视或勒索。因此，量化并向OSN用户展示此隐私风险的程度非常重要。实时风险量化的现有尝试对OSN中的匹配模型匹配不足，计算效率低下。因此，在这项工作中，我们开发了算法，以有效地对OSN中的匹配攻击进行有效建模和量化匹配攻击，以此作为实时隐私风险量化的一步。为此，我们使用图形对配置文件匹配问题进行了建模，并开发基于信念的传播（BP）算法，以与最先进的ART相比，以更有效和准确的方式解决此问题。我们在三个现实生活数据集（包括来自四个不同社交网络的数据）上评估了所提出的框架，并显示了如何有效地匹配不同OSN中用户的配置文件，并且概率很高。我们表明，所提出的模型生成在用户对数方面具有线性复杂性，这比最先进的（具有立方复杂性）要高得多。此外，与最先进的相比，它提供了可比的准确性，精度和召回。

Anonymous data sharing has been becoming more challenging in today's interconnected digital world, especially for individuals that have both anonymous and identified online activities. The most prominent example of such data sharing platforms today are online social networks (OSNs). Many individuals have multiple profiles in different OSNs, including anonymous and identified ones (depending on the nature of the OSN). Here, the privacy threat is profile matching: if an attacker links anonymous profiles of individuals to their real identities, it can obtain privacy-sensitive information which may have serious consequences, such as discrimination or blackmailing. Therefore, it is very important to quantify and show to the OSN users the extent of this privacy risk. Existing attempts to model profile matching in OSNs are inadequate and computationally inefficient for real-time risk quantification. Thus, in this work, we develop algorithms to efficiently model and quantify profile matching attacks in OSNs as a step towards real-time privacy risk quantification. For this, we model the profile matching problem using a graph and develop a belief propagation (BP)-based algorithm to solve this problem in a significantly more efficient and accurate way compared to the state-of-the-art. We evaluate the proposed framework on three real-life datasets (including data from four different social networks) and show how users' profiles in different OSNs can be matched efficiently and with high probability. We show that the proposed model generation has linear complexity in terms of number of user pairs, which is significantly more efficient than the state-of-the-art (which has cubic complexity). Furthermore, it provides comparable accuracy, precision, and recall compared to state-of-the-art.

下载PDF全文

下载文献需遵守相关版权规定

论文标题