论文标题
部分可观测时空混沌系统的无模型预测
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
论文作者
论文摘要
最初在神经机器翻译(NMT)中首次提出的非自动回旋(NAR)一代来加快推理,在机器学习和自然语言处理社区中引起了很多关注。尽管NAR的生成可以显着加速机器翻译的推理速度,但与同行,自回归(AR)的一代相比,加速度是以牺牲翻译精度为代价的。近年来,已经设计/提出了许多新的模型和算法,以弥合NAR生成与AR生成之间的准确性差距。在本文中,我们进行了一项系统的调查,并对来自不同方面的各种非自动回归翻译(NAT)模型进行了比较和讨论。具体而言,我们将NAT的努力分为几个组,包括数据操纵,建模方法,训练标准,解码算法以及预先训练的模型的好处。此外,我们简要介绍了NAR模型超出机器翻译的其他应用,例如语法错误校正,文本摘要,文本样式传输,对话,语义解析,自动语音识别等。此外,我们还讨论了未来探索的潜在方向,包括释放KD的依赖性,合理的培训目标,NAR的预培训以及更广泛的应用等。我们希望这项调查可以帮助研究人员捕获NAR生成的最新进展,激发高级NAR模型和算法的设计,以及为其应用程序选择适当的解决方案。该调查的网页位于\ url {https://github.com/litterbrother-xiao/overview-ob-non-autoregresctions-applications}。
Non-autoregressive (NAR) generation, which is first proposed in neural machine translation (NMT) to speed up inference, has attracted much attention in both machine learning and natural language processing communities. While NAR generation can significantly accelerate inference speed for machine translation, the speedup comes at the cost of sacrificed translation accuracy compared to its counterpart, autoregressive (AR) generation. In recent years, many new models and algorithms have been designed/proposed to bridge the accuracy gap between NAR generation and AR generation. In this paper, we conduct a systematic survey with comparisons and discussions of various non-autoregressive translation (NAT) models from different aspects. Specifically, we categorize the efforts of NAT into several groups, including data manipulation, modeling methods, training criterion, decoding algorithms, and the benefit from pre-trained models. Furthermore, we briefly review other applications of NAR models beyond machine translation, such as grammatical error correction, text summarization, text style transfer, dialogue, semantic parsing, automatic speech recognition, and so on. In addition, we also discuss potential directions for future exploration, including releasing the dependency of KD, reasonable training objectives, pre-training for NAR, and wider applications, etc. We hope this survey can help researchers capture the latest progress in NAR generation, inspire the design of advanced NAR models and algorithms, and enable industry practitioners to choose appropriate solutions for their applications. The web page of this survey is at \url{https://github.com/LitterBrother-Xiao/Overview-of-Non-autoregressive-Applications}.