论文标题

GDPR符合治疗师患者dialogues的收集

GDPR Compliant Collection of Therapist-Patient-Dialogues

论文作者

Mayer, Tobias, Warikoo, Neha, Grimm, Oliver, Reif, Andreas, Gurevych, Iryna

论文摘要

根据世界卫生组织(WHO)提供的全球疾病负担清单,精神障碍是最令人沮丧的疾病之一。近年来,为了提高诊断和治疗效果,研究人员试图识别单个生物标志物。但是,收集神经生物学数据是昂贵且耗时的。治疗师患者对话是另一个潜在的信息来源,它已经是临床常规的一部分。尽管有一些开创性的作品调查了语言作为各种治疗参数的预测因子的作用,例如患者 - 治疗师联盟,但没有大规模研究。进行这些研究的主要障碍是可用的数据集,这是训练机器学习模型所需的。尽管这些对话是临床医生日常工作的一部分,但通常会受到各种道德(数据使用的目的),法律(数据隐私)和技术(数据格式)限制的限制。这些局限性中的某些局限性特定在治疗对话领域,例如匿名的难度增加或录音的转录。在本文中,我们详细阐述了我们在欧盟的一般数据隐私监管下,在精神病学诊所开始收集治疗师患者对话所面临的挑战,其目标是将数据用于自然语言处理(NLP)研究。我们概述了我们的过程中的每个步骤,并指出了激励该领域进一步研究的潜在陷阱。

According to the Global Burden of Disease list provided by the World Health Organization (WHO), mental disorders are among the most debilitating disorders.To improve the diagnosis and the therapy effectiveness in recent years, researchers have tried to identify individual biomarkers. Gathering neurobiological data however, is costly and time-consuming. Another potential source of information, which is already part of the clinical routine, are therapist-patient dialogues. While there are some pioneering works investigating the role of language as predictors for various therapeutic parameters, for example patient-therapist alliance, there are no large-scale studies. A major obstacle to conduct these studies is the availability of sizeable datasets, which are needed to train machine learning models. While these conversations are part of the daily routine of clinicians, gathering them is usually hindered by various ethical (purpose of data usage), legal (data privacy) and technical (data formatting) limitations. Some of these limitations are particular to the domain of therapy dialogues, like the increased difficulty in anonymisation, or the transcription of the recordings. In this paper, we elaborate on the challenges we faced in starting our collection of therapist-patient dialogues in a psychiatry clinic under the General Data Privacy Regulation of the European Union with the goal to use the data for Natural Language Processing (NLP) research. We give an overview of each step in our procedure and point out the potential pitfalls to motivate further research in this field.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源