一个统一的端到端框架，用于有效的深层图像压缩

论文标题

一个统一的端到端框架，用于有效的深层图像压缩

A Unified End-to-End Framework for Efficient Deep Image Compression

论文作者

Liu, Jiaheng, Lu, Guo, Hu, Zhihao, Xu, Dong

论文摘要

图像压缩是一种广泛使用的技术，可降低图像中的空间冗余。最近，基于学习的图像压缩通过使用神经网络的强大表示能力取得了重大进展。但是，当前基于最新学习的图像压缩方法遭受了巨大的计算成本，这限制了其实用应用的能力。在本文中，我们提出了一个基于三种新技术的统一框架，称为有效的深层图像压缩（EDIC），包括通道注意模块，高斯混合模型和解码器端增强模块。具体来说，我们设计了一个自动编码器样式网络，用于基于学习的图像压缩。为了提高编码效率，我们使用通道注意模块利用潜在表示之间的通道关系。此外，为熵模型引入了高斯混合模型，并提高了比特率估计的准确性。此外，我们介绍了解码器侧增强模块，以进一步提高图像压缩性能。我们的EDIC方法也可以很容易地与深视频压缩（DVC）框架合并，以进一步改善视频压缩性能。同时，我们的EDIC方法显着提高了编码性能，同时带来了略有增加的计算成本。更重要的是，实验结果表明，所提出的方法的表现优于当前的最新图像压缩方法，并且与Minnen的方法相比，解码速度的速度快150倍以上。提出的框架还成功地改善了最近的深视频压缩系统DVC的性能。我们的代码将在https://github.com/liujiaheng/compression上发布。

Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current state-of-the-art image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method. The proposed framework also successfully improves the performance of the recent deep video compression system DVC. Our code will be released at https://github.com/liujiaheng/compression.

下载PDF全文

下载文献需遵守相关版权规定

论文标题