论文标题
dall-e 2无法可靠地捕获常见的句法过程
DALL-E 2 Fails to Reliably Capture Common Syntactic Processes
论文作者
论文摘要
机器智能越来越多地与关于感知,语言处理的主张以及将自然语言理解和转化为一系列刺激的能力的主张。我们系统地分析了DALL-E 2捕获与组合性有关的8个语法现象的能力,这些现象与语言学广泛讨论,在人类语言中广泛讨论:结合原理和核心,被动,单词顺序,单词顺序,协调,比较,否定,否定,椭圆形,椭圆形和结构性歧义。尽管幼儿通常掌握这些现象,而语法和语义之间的学习系统映射,而Dall-e 2无法可靠地推断出与语法一致的含义。这些结果挑战了有关这种系统了解人类语言的能力的最新主张。我们将完整的测试材料作为未来测试的基准提供。
Machine intelligence is increasingly being linked to claims about sentience, language processing, and an ability to comprehend and transform natural language into a range of stimuli. We systematically analyze the ability of DALL-E 2 to capture 8 grammatical phenomena pertaining to compositionality that are widely discussed in linguistics and pervasive in human language: binding principles and coreference, passives, word order, coordination, comparatives, negation, ellipsis, and structural ambiguity. Whereas young children routinely master these phenomena, learning systematic mappings between syntax and semantics, DALL-E 2 is unable to reliably infer meanings that are consistent with the syntax. These results challenge recent claims concerning the capacity of such systems to understand of human language. We make available the full set of test materials as a benchmark for future testing.