论文标题

尾部实体的学位意识对齐

Degree-Aware Alignment for Entities in Tail

论文作者

Zeng, Weixin, Zhao, Xiang, Wang, Wei, Tang, Jiuyang, Tan, Zhen

论文摘要

实体一致性(EA)是在知识图(kgs)中发现等效实体,该实体桥梁弥合了信息的异质来源并促进知识的整合。现有的EA解决方案主要依靠结构信息来对齐实体,通常是通过kg嵌入。尽管如此,在现实生活中,只有少数实体与他人密切相关,其余的大多数人具有相当稀疏的邻里结构。我们将后者称为长尾实体,并观察到这种现象可以说限制了对EA的结构信息的使用。为了减轻问题,我们为追求优雅的性能进行了重新访问并调查传统的EA管道。对于预先对准,我们建议以相对较弱的结构信息扩大长尾实体,并以串联幂的均值单词嵌入形式使用实体名称信息,通常可用(但被忽略)。为了对齐,在合并结构和名称信号的新型互补框架下,我们将实体学位确定为有效融合两个不同信息来源的重要指导。为此,构思了一个学位感知的共同注意网络,该网络以程度感知的方式动态调整了特征的重要性。为了进行后,我们建议通过使用自信的EA结果作为通过迭代培训作为锚点来补充原始KG的事实。全面的实验评估验证了我们提出的技术的优势。

Entity alignment (EA) is to discover equivalent entities in knowledge graphs (KGs), which bridges heterogeneous sources of information and facilitates the integration of knowledge. Existing EA solutions mainly rely on structural information to align entities, typically through KG embedding. Nonetheless, in real-life KGs, only a few entities are densely connected to others, and the rest majority possess rather sparse neighborhood structure. We refer to the latter as long-tail entities, and observe that such phenomenon arguably limits the use of structural information for EA. To mitigate the issue, we revisit and investigate into the conventional EA pipeline in pursuit of elegant performance. For pre-alignment, we propose to amplify long-tail entities, which are of relatively weak structural information, with entity name information that is generally available (but overlooked) in the form of concatenated power mean word embeddings. For alignment, under a novel complementary framework of consolidating structural and name signals, we identify entity's degree as important guidance to effectively fuse two different sources of information. To this end, a degree-aware co-attention network is conceived, which dynamically adjusts the significance of features in a degree-aware manner. For post-alignment, we propose to complement original KGs with facts from their counterparts by using confident EA results as anchors via iterative training. Comprehensive experimental evaluations validate the superiority of our proposed techniques.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源