论文标题
RR-DNCNN v2.0:增强基于下采样视频编码的增强恢复重建深神经网络
RR-DnCNN v2.0: Enhanced Restoration-Reconstruction Deep Neural Network for Down-Sampling Based Video Coding
论文作者
论文摘要
与标准压缩技术相比,将深度学习技术集成到视频编码框架中,尤其是将超分辨率(上采样)应用于基于下采样的视频编码作为后处理。但是,除了提高采样降解外,由压缩带来的各种文物使超分辨率问题更加难以解决。直接的解决方案是在超分辨率之前整合伪像的去除技术。但是,可以将一些有用的功能一起删除,从而降低超分辨率性能。为了解决这个问题,我们提出了一种使用降解感知技术的端到端恢复重建深神经网络(RR-DNCNN),该技术完全解决了压缩和子采样的降解。此外,我们证明,随机访问配置产生的压缩降解足以覆盖其他降解类型,例如低延迟P和所有内部培训。由于以许多层为链的直接网络RR-DNCNN具有较差的学习能力,因此遇到了梯度消失的问题,因此我们重新设计了网络体系结构,以使重建利用使用向上采样跳过连接而从修复中捕获的功能。我们的新颖架构称为Restoration-Reconstruction U形深神经网络(RR-DNCNN v2.0)。结果,我们的RR-DNCNN v2.0胜过以前的作品,可以在标准H.265/HEVC锚定的全intra的UHD分辨率上减少17.02%的BD率。源代码可在https://minhmanho.github.io/rrdncnn/上获得。
Integrating deep learning techniques into the video coding framework gains significant improvement compared to the standard compression techniques, especially applying super-resolution (up-sampling) to down-sampling based video coding as post-processing. However, besides up-sampling degradation, the various artifacts brought from compression make super-resolution problem more difficult to solve. The straightforward solution is to integrate the artifact removal techniques before super-resolution. However, some helpful features may be removed together, degrading the super-resolution performance. To address this problem, we proposed an end-to-end restoration-reconstruction deep neural network (RR-DnCNN) using the degradation-aware technique, which entirely solves degradation from compression and sub-sampling. Besides, we proved that the compression degradation produced by Random Access configuration is rich enough to cover other degradation types, such as Low Delay P and All Intra, for training. Since the straightforward network RR-DnCNN with many layers as a chain has poor learning capability suffering from the gradient vanishing problem, we redesign the network architecture to let reconstruction leverages the captured features from restoration using up-sampling skip connections. Our novel architecture is called restoration-reconstruction u-shaped deep neural network (RR-DnCNN v2.0). As a result, our RR-DnCNN v2.0 outperforms the previous works and can attain 17.02% BD-rate reduction on UHD resolution for all-intra anchored by the standard H.265/HEVC. The source code is available at https://minhmanho.github.io/rrdncnn/.