论文标题
关于物理概念的可学习性:神经网络可以理解什么是真实的吗?
On the Learnability of Physical Concepts: Can a Neural Network Understand What's Real?
论文作者
论文摘要
根据深层神经网络生成逼真的合成数据的显着能力,我们重新审视了经典的信号到符号屏障。深层和欺骗强调了物理现实与其抽象表示之间的联系,无论是由数字计算机学到的还是生物剂。从广泛适用的抽象概念定义开始,我们表明标准的前馈架构不能捕获琐碎的概念,尽管重量的数量和培训数据的量如何,尽管具有非常有效的分类器。另一方面,结合递归的架构可以代表更大的概念类别,但仍然无法从有限的数据集中学习它们。我们通过使用(自由能)Lagrangian来衡量信息复杂性的现代体系结构来描述可以“理解”的概念类别的概念类别。但是,即使已经理解了一个概念,除非通过持续的互动和验证,否则网络也没有将其理解传达给外部代理的方法。然后,我们将物理对象描述为抽象概念,并使用先前的分析表明物理对象可以由有限的体系结构编码。但是,要了解物理概念,传感器必须提供持久的令人兴奋的观察结果,控制数据采集过程的能力至关重要(主动感知)。控制的重要性取决于方式,而不是声学或化学感知,其视觉效果更大。最后,我们得出的结论是,在有限的资源中,在有限的时间内可以将物理实体绑定到数字身份,从原则上解决了信号到符号屏障问题,但我们强调了需要进行连续验证的必要性。
We revisit the classic signal-to-symbol barrier in light of the remarkable ability of deep neural networks to generate realistic synthetic data. DeepFakes and spoofing highlight the feebleness of the link between physical reality and its abstract representation, whether learned by a digital computer or a biological agent. Starting from a widely applicable definition of abstract concept, we show that standard feed-forward architectures cannot capture but trivial concepts, regardless of the number of weights and the amount of training data, despite being extremely effective classifiers. On the other hand, architectures that incorporate recursion can represent a significantly larger class of concepts, but may still be unable to learn them from a finite dataset. We qualitatively describe the class of concepts that can be "understood" by modern architectures trained with variants of stochastic gradient descent, using a (free energy) Lagrangian to measure information complexity. Even if a concept has been understood, however, a network has no means of communicating its understanding to an external agent, except through continuous interaction and validation. We then characterize physical objects as abstract concepts and use the previous analysis to show that physical objects can be encoded by finite architectures. However, to understand physical concepts, sensors must provide persistently exciting observations, for which the ability to control the data acquisition process is essential (active perception). The importance of control depends on the modality, benefiting visual more than acoustic or chemical perception. Finally, we conclude that binding physical entities to digital identities is possible in finite time with finite resources, solving in principle the signal-to-symbol barrier problem, but we highlight the need for continuous validation.