为什么受限的神经语言产生特别具有挑战性？

论文标题

为什么受限的神经语言产生特别具有挑战性？

Why is constrained neural language generation particularly challenging?

论文作者

Garbacea, Cristina, Mei, Qiaozhu

论文摘要

深度神经语言模型的最新进展与大规模数据集的能力相结合，加速了自然语言生成系统的发展，这些系统在多种任务和应用程序上下文中产生流利而连贯的文本（在各种成功程度上）。但是，为所需的用户控制这些模型的输出仍然是一个开放的挑战。这不仅对于自定义生成语言的内容和样式至关重要，而且对于他们在现实世界中的安全可靠部署至关重要。我们对受约束神经语言产生的新兴主题进行了广泛的调查，在该主题中，我们通过区分条件和约束（后者是在输出文本上而不是输入的可测试条件），当前的文本生成任务，并查看现有的方法和评估指标来确定文本生成任务的可检验条件和评估指标来正式定义和分类自然语言生成问题。我们的目的是强调这个新兴领域的最新进展和趋势，以告知最有希望的方向和局限性，以推动受约束神经语言生成研究的最新作用。

Recent advances in deep neural language models combined with the capacity of large scale datasets have accelerated the development of natural language generation systems that produce fluent and coherent texts (to various degrees of success) in a multitude of tasks and application contexts. However, controlling the output of these models for desired user and task needs is still an open challenge. This is crucial not only to customizing the content and style of the generated language, but also to their safe and reliable deployment in the real world. We present an extensive survey on the emerging topic of constrained neural language generation in which we formally define and categorize the problems of natural language generation by distinguishing between conditions and constraints (the latter being testable conditions on the output text instead of the input), present constrained text generation tasks, and review existing methods and evaluation metrics for constrained text generation. Our aim is to highlight recent progress and trends in this emerging field, informing on the most promising directions and limitations towards advancing the state-of-the-art of constrained neural language generation research.

下载PDF全文

下载文献需遵守相关版权规定

论文标题