论文标题

朝着集成和开放的Covid-19数据

Towards Integrated and Open COVID-19 Data

论文作者

Santipantakis, Georgios M., Vouros, George A., Doulkeridis, Christos

论文摘要

受到与Covid-19大流行有关的全球动荡的动机,我们提出了一个基于本体的系统原型,该系统的集成了来自各个国家的国家数据。与共同相关的数据以不同格式,不同时空的粒度和不规则的不同形式发表。因此,这阻碍了联合数据探索和剥削,这可能会导致科学家获得重要的见解,而不必处理繁琐的数据获取和集成任务。在此缺点的促进下,我们提出了一种用于数据获取,基于本体的数据表示和数据转换为RDF的方法,这也使与其他公开可用的数据源相互联系。目前,来自以下欧洲国家的数据已成功整合:奥地利,比利时,法国,德国,希腊,意大利和瑞典。知识库将自动更新,并通过SPARQL端点和直接下载链接向公众使用。此外,我们通过有意义的查询来展示数据集成如何实现时空数据分析和知识发现,而这些查询是不可行的。

Motivated by the global unrest related to the COVID-19 pandemic, we present a system prototype for ontology-based, integration of national data published from various countries. COVID-related data is published from different authorities, in different formats, at varying spatio-temporal granularity, and irregularly. Consequently, this hinders the joint data exploration and exploitation, which could lead scientists to acquire important insights, without having to deal with the cumbersome task of data acquisition and integration. Motivated by this shortcoming, we propose an approach for data acquisition, ontology-based data representation, and data transformation to RDF, which also enables interlinking with other publicly available data sources. Currently, data coming from the following European countries has been successfully integrated: Austria, Belgium, France, Germany, Greece, Italy, and Sweden. The knowledge base is automatically being updated, and it is available to the public through a SPARQL endpoint and a direct download link. Furthermore, we showcase how data integration enables spatio-temporal data analysis and knowledge discovery, by means of meaningful queries that would not be feasible to process otherwise.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源