论文标题
聊天聊天增强了融合动机对话系统的以任务为导向的对话公司
A Chit-Chats Enhanced Task-Oriented Dialogue Corpora for Fuse-Motive Conversation Systems
论文作者
论文摘要
建立智能对话系统的目标在很大程度上是在两个动机下分别追求的:面向任务的对话(TOD)系统和Chit-Chat(CC)的开放域系统。尽管以前的TOD对话系统在基准的测试集中效果很好,但在实践中暴露于自然情景时,它们会导致不良的失败,在这种情况下,用户的话语可能具有高动力多样性,从而在多扭转互动中融合TOD和CC。由于工业TOD系统应该能够与TOD和CC动机之间的用户交谈,因此构建包含TOD或CC的Fuse-Motive对话数据集很重要。大多数先前的工作都依靠人群工人来收集和注释大型数据集,并且仅限于英语设置。相反,我们的工作以更有效的方式解决了这个问题,并发布了称为CCET(中文聊天增强任务)的多转话数据集。同时,我们还提出了一系列的融合功能对话方式形式化方法,以及对CC话语集成的TOD会话的几个评估指标。
The goal of building intelligent dialogue systems has largely been separately pursued under two motives: task-oriented dialogue (TOD) systems, and open-domain systems for chit-chat (CC). Although previous TOD dialogue systems work well in the testing sets of benchmarks, they would lead to undesirable failure when being exposed to natural scenarios in practice, where user utterances can be of high motive-diversity that fusing both TOD and CC in multi-turn interaction. Since an industrial TOD system should be able to converse with the user between TOD and CC motives, constructing a fuse-motive dialogue dataset that contains both TOD or CC is important. Most prior work relies on crowd workers to collect and annotate large scale dataset and is restricted to English language setting. Our work, on the contrary, addresses this problem in a more effective way and releases a multi-turn dialogues dataset called CCET (Chinese Chat-Enhanced-Task). Meanwhile, we also propose a line of fuse-motive dialogues formalization approach, along with several evaluation metrics for TOD sessions that are integrated by CC utterances.