论文标题
电影中几乎没有射击的角色理解,作为对元理论的元学习的评估
Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind
论文作者
论文摘要
在阅读故事时,人类可以通过一些观察来迅速理解新的虚构人物,主要是通过对他们已经认识的虚构和真实的人进行类比。这反映了人类对角色的精神状态的推断,即伴随理论(汤姆)的少量和元学习本质,在现有研究中很大程度上被忽略了。我们通过一个新颖的NLP数据集Tom-In-AMC填补了这一空白,这是机器对TOM的元学习的首次评估,这是在现实的叙事理解场景中。我们的数据集由约1,000个解析的电影脚本组成,每个脚本都对应于几个弹奏的角色理解任务,该任务需要模型,以模仿人类在新电影中使用一些启动场景的快速消化角色的能力。 我们提出了一种新颖的TOM提示方法,旨在明确评估多个Tom维度的影响。它超过了现有的基线模型,强调了为我们的任务建模多个TOM维度的重要性。我们广泛的人类研究验证了人类能够通过基于以前看到的电影来推断角色的心理状态来解决我们的问题。相比之下,我们的系统基于最先进的大语言模型(GPT-4)或元学习算法落后于20%,突显了现有方法的TOM功能的明显限制。
When reading a story, humans can quickly understand new fictional characters with a few observations, mainly by drawing analogies to fictional and real people they already know. This reflects the few-shot and meta-learning essence of humans' inference of characters' mental states, i.e., theory-of-mind (ToM), which is largely ignored in existing research. We fill this gap with a novel NLP dataset, ToM-in-AMC, the first assessment of machines' meta-learning of ToM in a realistic narrative understanding scenario. Our dataset consists of ~1,000 parsed movie scripts, each corresponding to a few-shot character understanding task that requires models to mimic humans' ability of fast digesting characters with a few starting scenes in a new movie. We propose a novel ToM prompting approach designed to explicitly assess the influence of multiple ToM dimensions. It surpasses existing baseline models, underscoring the significance of modeling multiple ToM dimensions for our task. Our extensive human study verifies that humans are capable of solving our problem by inferring characters' mental states based on their previously seen movies. In comparison, our systems based on either state-of-the-art large language models (GPT-4) or meta-learning algorithms lags >20% behind, highlighting a notable limitation in existing approaches' ToM capabilities.