论文标题
注意力的焦点改善了视觉特征的信息传递
Focus of Attention Improves Information Transfer in Visual Features
论文作者
论文摘要
从连续的视觉流进行无监督的学习是一个具有挑战性的问题,在经典的计算设置中无法自然和有效地管理。信息流必须相应地处理视觉数据的适当时空分布,而大多数学习方法通常假设概率密度均匀。在本文中,我们专注于无监督的学习,用于在真正的在线环境中传输视觉信息,该计算模型受到启发,该模型受到物理学最少的原则的启发。相互信息的最大化是通过时间过程进行的,该时间过程得出熵项的在线估计。该模型基于二阶微分方程,最大化信息从输入到与输入的视觉特征相关的符号的离散空间,其计算受隐藏的神经元支持。为了更好地构建输入概率分布,我们使用类似人类的注意模型的焦点,该模型与信息最大化模型一致,也基于二阶微分方程。我们提供实验结果来支持该理论,以表明注意力的重点引起的时空滤波使该系统能够在集中区域上从输入流中传输更多信息,并且在某些情况下,在整个框架上,就未过滤的情况而言,在整个框架上都可以产生统一的概率分布。
Unsupervised learning from continuous visual streams is a challenging problem that cannot be naturally and efficiently managed in the classic batch-mode setting of computation. The information stream must be carefully processed accordingly to an appropriate spatio-temporal distribution of the visual data, while most approaches of learning commonly assume uniform probability density. In this paper we focus on unsupervised learning for transferring visual information in a truly online setting by using a computational model that is inspired to the principle of least action in physics. The maximization of the mutual information is carried out by a temporal process which yields online estimation of the entropy terms. The model, which is based on second-order differential equations, maximizes the information transfer from the input to a discrete space of symbols related to the visual features of the input, whose computation is supported by hidden neurons. In order to better structure the input probability distribution, we use a human-like focus of attention model that, coherently with the information maximization model, is also based on second-order differential equations. We provide experimental results to support the theory by showing that the spatio-temporal filtering induced by the focus of attention allows the system to globally transfer more information from the input stream over the focused areas and, in some contexts, over the whole frames with respect to the unfiltered case that yields uniform probability distributions.