论文标题

化学引起的疾病关系提取与依赖信息和先验知识

Chemical-induced Disease Relation Extraction with Dependency Information and Prior Knowledge

论文作者

Zhou, Huiwei, Ning, Shixian, Yang, Yunlong, Liu, Zhuang, Lang, Chengkun, Lin, Yingyu

论文摘要

化学疾病关系(CDR)提取对生物医学研究和医疗保健的各个领域至关重要。如今,许多大规模的生物医学知识库(KB),其中包含有关实体对及其关系的三元组。 KB是生物医学关系提取的重要资源。但是,先前的研究对先验知识几乎没有关注。另外,依赖树包含重要的句法和语义信息,这有助于改善关系提取。因此,如何有效使用它也值得研究。在本文中,我们提出了一个新型的卷积注意网络(CAN),以进行CDR提取。首先,我们在句子中提取化学和疾病对之间的最短依赖性路径(SDP),其中包括一系列单词,依赖方向和依赖关系标签。然后在SDP上执行卷积操作,以产生深层的语义依赖性特征。之后,采用了一种注意机制来学习与从KBS学到的知识表示相关的每个语义依赖向量的重要性/权重。最后,为了结合依赖性信息和先验知识,加权语义依赖性表示和知识表示的串联被馈送到SoftMax层以进行分类。生物依据V CDR数据集的实验表明,我们的方法与最先进的系统实现了可比的性能,并且依赖性信息和先验知识在CDR提取任务中都起着重要作用。

Chemical-disease relation (CDR) extraction is significantly important to various areas of biomedical research and health care. Nowadays, many large-scale biomedical knowledge bases (KBs) containing triples about entity pairs and their relations have been built. KBs are important resources for biomedical relation extraction. However, previous research pays little attention to prior knowledge. In addition, the dependency tree contains important syntactic and semantic information, which helps to improve relation extraction. So how to effectively use it is also worth studying. In this paper, we propose a novel convolutional attention network (CAN) for CDR extraction. Firstly, we extract the shortest dependency path (SDP) between chemical and disease pairs in a sentence, which includes a sequence of words, dependency directions, and dependency relation tags. Then the convolution operations are performed on the SDP to produce deep semantic dependency features. After that, an attention mechanism is employed to learn the importance/weight of each semantic dependency vector related to knowledge representations learned from KBs. Finally, in order to combine dependency information and prior knowledge, the concatenation of weighted semantic dependency representations and knowledge representations is fed to the softmax layer for classification. Experiments on the BioCreative V CDR dataset show that our method achieves comparable performance with the state-of-the-art systems, and both dependency information and prior knowledge play important roles in CDR extraction task.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源