实时显着性预测的紧凑型深度体系结构

论文标题

实时显着性预测的紧凑型深度体系结构

A Compact Deep Architecture for Real-time Saliency Prediction

论文作者

Zabihi, Saman, Tavakoli, Hamed Rezazadegan, Borji, Ali

论文摘要

显着计算模型旨在模仿人类视觉系统中的注意力机制。在过去的几年中，深度神经网络在显着性预测中的应用导致了急剧改善。但是，深层模型具有大量参数，这使其不适合实时应用。在这里，我们为实时显着性预测提供了一个紧凑而快速的模型。我们提出的模型由修改后的U-NET体系结构，新颖的完全连接的层和中心差卷积层组成。修改后的U-NET体系结构促进了紧凑性和效率。新颖的完全连接的层有助于依赖位置的信息的隐式捕获。在不同尺度上使用中心差卷积层可捕获更健壮和具有生物学动机的特征。我们使用传统显着性分数以及新设计的计划将模型与艺术显着性模型进行比较。四个挑战性基准数据集的实验结果证明了我们方法在准确性和速度之间取得平衡的有效性。我们的模型可以实时运行，这使其对边缘设备和视频处理具有吸引力。

Saliency computation models aim to imitate the attention mechanism in the human visual system. The application of deep neural networks for saliency prediction has led to a drastic improvement over the last few years. However, deep models have a high number of parameters which makes them less suitable for real-time applications. Here we propose a compact yet fast model for real-time saliency prediction. Our proposed model consists of a modified U-net architecture, a novel fully connected layer, and central difference convolutional layers. The modified U-Net architecture promotes compactness and efficiency. The novel fully-connected layer facilitates the implicit capturing of the location-dependent information. Using the central difference convolutional layers at different scales enables capturing more robust and biologically motivated features. We compare our model with state of the art saliency models using traditional saliency scores as well as our newly devised scheme. Experimental results over four challenging saliency benchmark datasets demonstrate the effectiveness of our approach in striking a balance between accuracy and speed. Our model can be run in real-time which makes it appealing for edge devices and video processing.

下载PDF全文

下载文献需遵守相关版权规定

论文标题