论文标题
谁失踪了?表征不同人口群体参与韩国全国每日对话语料库的参与
Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus
论文作者
论文摘要
对话语料库对于构建交互式AI应用程序至关重要。但是,此类语料库中参与者的人口统计信息在很大程度上是由于许多语料库中缺乏个人数据而被忽视的。在这项工作中,我们分析了由国家韩国语言研究所(NIKL)建立的全国性日常对话语料库,以表征不同人口统计学(年龄和性别)群体在语料库中的参与。
A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.