RR-DNCNN v2.0：增强基于下采样视频编码的增强恢复重建深神经网络

论文标题

RR-DNCNN v2.0：增强基于下采样视频编码的增强恢复重建深神经网络

RR-DnCNN v2.0: Enhanced Restoration-Reconstruction Deep Neural Network for Down-Sampling Based Video Coding

论文作者

Ho, Man M., Zhou, Jinjia, He, Gang

论文摘要

与标准压缩技术相比，将深度学习技术集成到视频编码框架中，尤其是将超分辨率（上采样）应用于基于下采样的视频编码作为后处理。但是，除了提高采样降解外，由压缩带来的各种文物使超分辨率问题更加难以解决。直接的解决方案是在超分辨率之前整合伪像的去除技术。但是，可以将一些有用的功能一起删除，从而降低超分辨率性能。为了解决这个问题，我们提出了一种使用降解感知技术的端到端恢复重建深神经网络（RR-DNCNN），该技术完全解决了压缩和子采样的降解。此外，我们证明，随机访问配置产生的压缩降解足以覆盖其他降解类型，例如低延迟P和所有内部培训。由于以许多层为链的直接网络RR-DNCNN具有较差的学习能力，因此遇到了梯度消失的问题，因此我们重新设计了网络体系结构，以使重建利用使用向上采样跳过连接而从修复中捕获的功能。我们的新颖架构称为Restoration-Reconstruction U形深神经网络（RR-DNCNN v2.0）。结果，我们的RR-DNCNN v2.0胜过以前的作品，可以在标准H.265/HEVC锚定的全intra的UHD分辨率上减少17.02％的BD率。源代码可在https://minhmanho.github.io/rrdncnn/上获得。

Integrating deep learning techniques into the video coding framework gains significant improvement compared to the standard compression techniques, especially applying super-resolution (up-sampling) to down-sampling based video coding as post-processing. However, besides up-sampling degradation, the various artifacts brought from compression make super-resolution problem more difficult to solve. The straightforward solution is to integrate the artifact removal techniques before super-resolution. However, some helpful features may be removed together, degrading the super-resolution performance. To address this problem, we proposed an end-to-end restoration-reconstruction deep neural network (RR-DnCNN) using the degradation-aware technique, which entirely solves degradation from compression and sub-sampling. Besides, we proved that the compression degradation produced by Random Access configuration is rich enough to cover other degradation types, such as Low Delay P and All Intra, for training. Since the straightforward network RR-DnCNN with many layers as a chain has poor learning capability suffering from the gradient vanishing problem, we redesign the network architecture to let reconstruction leverages the captured features from restoration using up-sampling skip connections. Our novel architecture is called restoration-reconstruction u-shaped deep neural network (RR-DnCNN v2.0). As a result, our RR-DnCNN v2.0 outperforms the previous works and can attain 17.02% BD-rate reduction on UHD resolution for all-intra anchored by the standard H.265/HEVC. The source code is available at https://minhmanho.github.io/rrdncnn/.

下载PDF全文

下载文献需遵守相关版权规定

论文标题