论文标题
NL2GDPR:自然语言自动开发符合GDPR的Android应用功能
NL2GDPR: Automatically Develop GDPR Compliant Android Application Features from Natural Language
论文作者
论文摘要
最近的隐私泄漏事件和更严格的政策法规要求公司和移动应用程序的合规标准更高。但是,这些义务还对应用程序开发人员构成了包含各种观点,活动和角色的这些法规,尤其是对于在此问题或资源有限的小型公司和开发人员方面构成了重大挑战。为了解决这些障碍,我们开发了一个自动工具NL2GDPR,该工具可以从开发人员的自然语言描述中制定策略,同时还确保应用程序的功能符合通用数据保护法规(GDPR)。 NL2GDPR是通过利用Baidu认知计算实验室开发的信息提取工具OIA(开放信息注释)来开发的。 核心,NL2GDPR是一个以隐私为中心的信息提取模型,附有GDPR策略查找器和策略生成器。我们进行了一项全面的研究,以掌握提取以隐私为中心的信息和制定隐私政策的挑战,同时利用这项特定任务的优化。借助NL2GDPR,我们可以在正确识别与个人数据存储,过程和共享类型相关的GDPR策略方面获得92.9%,95.2%和98.4%的精度。据我们所知,NL2GDPR是第一个允许开发人员自动生成符合GDPR的策略的工具,只需要输入自然语言来描述应用程序功能。请注意,其他非GDPR相关功能可能与生成的功能集成以构建复杂的应用程序。
The recent privacy leakage incidences and the more strict policy regulations demand a much higher standard of compliance for companies and mobile apps. However, such obligations also impose significant challenges on app developers for complying with these regulations that contain various perspectives, activities, and roles, especially for small companies and developers who are less experienced in this matter or with limited resources. To address these hurdles, we develop an automatic tool, NL2GDPR, which can generate policies from natural language descriptions from the developer while also ensuring the app's functionalities are compliant with General Data Protection Regulation (GDPR). NL2GDPR is developed by leveraging an information extraction tool, OIA (Open Information Annotation), developed by Baidu Cognitive Computing Lab. At the core, NL2GDPR is a privacy-centric information extraction model, appended with a GDPR policy finder and a policy generator. We perform a comprehensive study to grasp the challenges in extracting privacy-centric information and generating privacy policies, while exploiting optimizations for this specific task. With NL2GDPR, we can achieve 92.9%, 95.2%, and 98.4% accuracy in correctly identifying GDPR policies related to personal data storage, process, and share types, respectively. To the best of our knowledge, NL2GDPR is the first tool that allows a developer to automatically generate GDPR compliant policies, with only the need of entering the natural language for describing the app features. Note that other non-GDPR-related features might be integrated with the generated features to build a complex app.