论文标题

数据操纵:通过学习增强和重量的有效实例学习神经对话生成

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

论文作者

Cai, Hengyi, Chen, Hongshen, Song, Yonghao, Zhang, Cheng, Zhao, Xiaofang, Yin, Dawei

论文摘要

当前的最新神经对话模型从数据驱动的范式之后从人类对话中学习。因此,可靠的培训语料库是建立强大且举止良好的对话模型的症结所在。但是,由于人类对话的开放性质,用户生成的培训数据的质量差异很大,而且有效的培训样本通常不足,而嘈杂的样本经常出现。这阻碍了学习这些数据驱动的神经对话模型。因此,有效的对话学习不仅需要更可靠的学习样本,而且还需要更少的嘈杂样本。在本文中,我们提出了一个数据操纵框架,通过增强和突出有效的学习样本并同时降低效率低下样品的效果,从而主动将数据分布重塑为可靠的样本。特别是,数据操纵模型选择性地增加了培训样本,并为每个实例分配了重大权重,以改革培训数据。请注意,提出的数据操作框架是完全数据驱动的,并且可以学习。它不仅可以操纵培训样本来优化对话生成模型,而且还学会通过使用验证样本来提高其操纵技巧。广泛的实验表明,我们的框架可以在各种自动评估指标和人类判断方面提高对话的产生绩效。

Current state-of-the-art neural dialogue models learn from human conversations following the data-driven paradigm. As such, a reliable training corpus is the crux of building a robust and well-behaved dialogue model. However, due to the open-ended nature of human conversations, the quality of user-generated training data varies greatly, and effective training samples are typically insufficient while noisy samples frequently appear. This impedes the learning of those data-driven neural dialogue models. Therefore, effective dialogue learning requires not only more reliable learning samples, but also fewer noisy samples. In this paper, we propose a data manipulation framework to proactively reshape the data distribution towards reliable samples by augmenting and highlighting effective learning samples as well as reducing the effect of inefficient samples simultaneously. In particular, the data manipulation model selectively augments the training samples and assigns an importance weight to each instance to reform the training data. Note that, the proposed data manipulation framework is fully data-driven and learnable. It not only manipulates training samples to optimize the dialogue generation model, but also learns to increase its manipulation skills through gradient descent with validation samples. Extensive experiments show that our framework can improve the dialogue generation performance with respect to various automatic evaluation metrics and human judgments.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源