道德，参与规则和AI：使用大型变压器模型的神经叙事映射

论文标题

道德，参与规则和AI：使用大型变压器模型的神经叙事映射

Ethics, Rules of Engagement, and AI: Neural Narrative Mapping Using Large Transformer Language Models

论文作者

Feldman, Philip, Dant, Aaron, Rosenbluth, David

论文摘要

确定军事部队是否正确理解命令并正确执行的问题是在整个历史上都陷入困境的军事规划师。 OpenAI的GPT系列等高级语言模型的出现为解决此问题提供了新的可能性。本文提出了一种利用大语言模型的叙事输出的机制，并产生在GPT-3等模型中潜在的关系的图表或“地图”。由此产生的“神经叙事地图”（NNMS）旨在提供对模型中信息，意见和信念的组织的见解，这又提供了在物理距离的背景下理解意图和响应的手段。本文讨论了一般映射信息空间的问题，然后在OpenAI的GPT-3语言模型的背景下提出了该概念的具体实现，以确定下属在高风险情况下是否遵循指挥官的意图。 NNM中的下属的位置允许一种新颖的能力来评估下属相对于指挥官的意图。我们表明，这不仅可以确定它们是否在叙事空间附近，还可以确定它们的定向以及它们所在的“轨迹”。我们的结果表明，我们的方法能够生成高质量的地图，并展示了更广泛地评估意图的新方法。

The problem of determining if a military unit has correctly understood an order and is properly executing on it is one that has bedeviled military planners throughout history. The advent of advanced language models such as OpenAI's GPT-series offers new possibilities for addressing this problem. This paper presents a mechanism to harness the narrative output of large language models and produce diagrams or "maps" of the relationships that are latent in the weights of such models as the GPT-3. The resulting "Neural Narrative Maps" (NNMs), are intended to provide insight into the organization of information, opinion, and belief in the model, which in turn provide means to understand intent and response in the context of physical distance. This paper discusses the problem of mapping information spaces in general, and then presents a concrete implementation of this concept in the context of OpenAI's GPT-3 language model for determining if a subordinate is following a commander's intent in a high-risk situation. The subordinate's locations within the NNM allow a novel capability to evaluate the intent of the subordinate with respect to the commander. We show that is is possible not only to determine if they are nearby in narrative space, but also how they are oriented, and what "trajectory" they are on. Our results show that our method is able to produce high-quality maps, and demonstrate new ways of evaluating intent more generally.

下载PDF全文

下载文献需遵守相关版权规定

论文标题