论文标题
一项研究,机器的端到端优化图像压缩
End-to-end optimized image compression for machines, a study
论文作者
论文摘要
图像和视频内容的份额越来越多,由机器分析,而不是人类观察到,因此,在远程进行分析的此类应用程序中优化编解码器是相关的。不幸的是,传统的编码工具最初是为人类感知而设计的,因此具有挑战性地专门研究机器任务。但是,基于神经网络的编解码器可以与基于任何卷积神经网络(CNN)的任务模型共同训练端到端。在本文中,我们建议使用由压缩模块组成的链和一个可以优化端到端的任务算法组成的链条,从而为远程机器任务分析提供有效的图像压缩。我们表明,当对编解码器和任务网络进行微调时,尤其是在低比特率下,可以显着提高任务准确性。根据培训或部署的约束,只能在编码器,解码器或任务网络上应用选择性微调,并且仍然可以在现成的编解码器和任务网络上实现速率准确性的改进。我们的结果还证明了端到端管道用于实际应用的灵活性。
An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately, conventional coding tools are challenging to specialize for machine tasks as they were originally designed for human perception. However, neural network based codecs can be jointly trained end-to-end with any convolutional neural network (CNN)-based task model. In this paper, we propose to study an end-to-end framework enabling efficient image compression for remote machine task analysis, using a chain composed of a compression module and a task algorithm that can be optimized end-to-end. We show that it is possible to significantly improve the task accuracy when fine-tuning jointly the codec and the task networks, especially at low bit-rates. Depending on training or deployment constraints, selective fine-tuning can be applied only on the encoder, decoder or task network and still achieve rate-accuracy improvements over an off-the-shelf codec and task network. Our results also demonstrate the flexibility of end-to-end pipelines for practical applications.