论文标题
用于系统质量评估链接的开放数据的度量套件
A metric Suite for Systematic Quality Assessment of Linked Open Data
论文作者
论文摘要
摘要 - 链接的开放数据(LOD)计划的愿景是为发布和有意义地相互链接开放数据提供分布式模型。该目标的实现在很大程度上取决于作为LOD的一部分发布的数据质量。本文着重于在LOD云发布之前对数据集进行系统的质量评估。为此,我们确定了在发布数据集之前需要避免和/或解决需要避免和/或解决的重要质量缺陷。然后,我们建议一组指标来测量数据集中的这些质量缺陷。这样,我们可以通过我们提出的指标来评估和识别数据集的不良质量特征。这将有助于发布者根据质量评估结果来滤除低质量数据,这又使数据消费者在使用开放数据集时能够做出更好,更明智的决策。
Abstract- The vision of the Linked Open Data (LOD) initiative is to provide a distributed model for publishing and meaningfully interlinking open data. The realization of this goal depends strongly on the quality of the data that is published as a part of the LOD. This paper focuses on the systematic quality assessment of datasets prior to publication on the LOD cloud. To this end, we identify important quality deficiencies that need to be avoided and/or resolved prior to the publication of a dataset. We then propose a set of metrics to measure these quality deficiencies in a dataset. This way, we enable the assessment and identification of undesirable quality characteristics of a dataset through our proposed metrics. This will help publishers to filter out low-quality data based on the quality assessment results, which in turn enables data consumers to make better and more informed decisions when using the open datasets.