论文标题
EXEM:使用深度学习方法使用主导集理论嵌入专家
ExEm: Expert Embedding using dominating set theory with deep learning approaches
论文作者
论文摘要
协作网络是一个社交网络,由专家组成,他们相互合作以实现特殊目标。分析该网络会产生有关这些专家及其学科领域的专业知识的有意义的信息。为了进行分析,图形嵌入技术已成为一种有效且有前途的工具。图形嵌入尝试表示图形节点为低维矢量。在本文中,我们提出了一种称为EXEM的图形嵌入方法,该方法使用主导集理论和深度学习方法来捕获节点表示。 EXEM发现协作网络的主导节点,并构建了至少两个主导节点的智能随机步行。一个主导的节点应出现在采样的每个路径的开头,以表征本地社区。此外,第二个主导节点反映了全局结构信息。为了学习节点嵌入,EXEM利用了三种嵌入方法,包括Word2Vec,fastText和这两个的串联。最终结果是专家的低维媒介,称为专家嵌入。提取的专家嵌入可以应用于许多应用程序。为了将这些嵌入到专家推荐系统中,我们引入了一种新颖的策略,该策略使用专家向量计算专家的分数并推荐专家。最后,我们进行了广泛的实验,以通过评估其对普通数据集的多标签分类,链接预测和建议任务的效果来验证EXEM的有效性,以及通过爬行庞大的作者Scopus概况而形成的收集数据。实验表明,在密集网络中,尤其是基准的表现优于基准。
A collaborative network is a social network that is comprised of experts who cooperate with each other to fulfill a special goal. Analyzing this network yields meaningful information about the expertise of these experts and their subject areas. To perform the analysis, graph embedding techniques have emerged as an effective and promising tool. Graph embedding attempts to represent graph nodes as low-dimensional vectors. In this paper, we propose a graph embedding method, called ExEm, that uses dominating-set theory and deep learning approaches to capture node representations. ExEm finds dominating nodes of the collaborative network and constructs intelligent random walks that comprise of at least two dominating nodes. One dominating node should appear at the beginning of each path sampled to characterize the local neighborhoods. Moreover, the second dominating node reflects the global structure information. To learn the node embeddings, ExEm exploits three embedding methods including Word2vec, fastText and the concatenation of these two. The final result is the low-dimensional vectors of experts, called expert embeddings. The extracted expert embeddings can be applied to many applications. In order to extend these embeddings into the expert recommendation system, we introduce a novel strategy that uses expert vectors to calculate experts' scores and recommend experts. At the end, we conduct extensive experiments to validate the effectiveness of ExEm through assessing its performance over the multi-label classification, link prediction, and recommendation tasks on common datasets and our collected data formed by crawling the vast author Scopus profiles. The experiments show that ExEm outperforms the baselines especially in dense networks.