论文标题

为什么卷积网络学习定向带通滤波器:理论和经验支持

Why Convolutional Networks Learn Oriented Bandpass Filters: Theory and Empirical Support

论文作者

Hadji, Isma, Wildes, Richard P.

论文摘要

已经反复观察到,将卷积体系结构应用于图像理解任务时学习面向带通滤波器。对此结果的标准说明是,这些过滤器反映了训练过程中暴露于图像的结构:自然图像通常是由各种尺度的定向轮廓组成的,并且与该结构相匹配。我们提供了一种基于图像结构,而是基于卷积体系结构的结构提供的替代解释。特别是,复杂的指数是卷积的特征函数。这些本征函数是在全球定义的。但是,卷积体系结构在本地运作。为了强制执行局部性,可以将窗口功能应用于本征函数,从而导致面向带的带通滤波器作为自然操作员使用卷积架构学习。从代表性的角度来看,这些过滤器允许在图像或其他信号上表征和操作的局部系统方法。我们为卷积网络在所有卷积层学习此类过滤器的假设提供了经验支持。虽然先前的研究表明,早期层的过滤器具有定向的带通特性的证据,但我们的研究似乎是第一个记录所有层过滤特征占主导地位的研究。先前的研究错过了这一观察结果,因为它们集中在跨层过滤的累积组成效应上,同时我们检查了每一层中存在的滤波器特性。

It has been repeatedly observed that convolutional architectures when applied to image understanding tasks learn oriented bandpass filters. A standard explanation of this result is that these filters reflect the structure of the images that they have been exposed to during training: Natural images typically are locally composed of oriented contours at various scales and oriented bandpass filters are matched to such structure. We offer an alternative explanation based not on the structure of images, but rather on the structure of convolutional architectures. In particular, complex exponentials are the eigenfunctions of convolution. These eigenfunctions are defined globally; however, convolutional architectures operate locally. To enforce locality, one can apply a windowing function to the eigenfunctions, which leads to oriented bandpass filters as the natural operators to be learned with convolutional architectures. From a representational point of view, these filters allow for a local systematic way to characterize and operate on an image or other signal. We offer empirical support for the hypothesis that convolutional networks learn such filters at all of their convolutional layers. While previous research has shown evidence of filters having oriented bandpass characteristics at early layers, ours appears to be the first study to document the predominance of such filter characteristics at all layers. Previous studies have missed this observation because they have concentrated on the cumulative compositional effects of filtering across layers, while we examine the filter characteristics that are present at each layer.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源