从参考性游戏中的实用主义者之间的出现

论文标题

从参考性游戏中的实用主义者之间的出现

Emergence of Pragmatics from Referential Game between Theory of Mind Agents

论文作者

Yuan, Luyao, Fu, Zipeng, Shen, Jingyue, Xu, Lu, Shen, Junhong, Zhu, Song-Chun

论文摘要

实用主义者研究上下文如何有助于语言含义。在人类交流中，语言永远不会出于上下文的解释，句子通常可以传达更多的信息，而不是其字面意义。但是，在大多数多机构系统中缺少这种机制，从而限制了人类代理人相互作用的沟通效率和能力。在本文中，我们提出了一种算法，使用该算法可以自发地学习在没有任何明确手工设计的规则的情况下“读取”的能力。我们将心理理论（TOM）整合在合作的多代理教学情况下，并提出一种适应性增强学习（RL）算法来开发通信协议。汤姆（Tom）是一个深刻的认知科学概念，声称人们经常理解他人的心理状态，包括信念，目标和意图，以在竞争，合作或联盟中获得绩效优势。凭借这种能力，代理人不仅将语言视为信息，而且还将其理性行为视为反映他人隐藏状态的理性行为。我们的实验证明了实用方案比非弹性方案的优势。我们还显示了务实协议后的教学复杂性，从经验上近似于递归教学维度（RTD）。

Pragmatics studies how context can contribute to language meanings. In human communication, language is never interpreted out of context, and sentences can usually convey more information than their literal meanings. However, this mechanism is missing in most multi-agent systems, restricting the communication efficiency and the capability of human-agent interaction. In this paper, we propose an algorithm, using which agents can spontaneously learn the ability to "read between lines" without any explicit hand-designed rules. We integrate the theory of mind (ToM) in a cooperative multi-agent pedagogical situation and propose an adaptive reinforcement learning (RL) algorithm to develop a communication protocol. ToM is a profound cognitive science concept, claiming that people regularly reason about other's mental states, including beliefs, goals, and intentions, to obtain performance advantage in competition, cooperation or coalition. With this ability, agents consider language as not only messages but also rational acts reflecting others' hidden states. Our experiments demonstrate the advantage of pragmatic protocols over non-pragmatic protocols. We also show the teaching complexity following the pragmatic protocol empirically approximates to recursive teaching dimension (RTD).

下载PDF全文

下载文献需遵守相关版权规定

论文标题