CIE XYZ NET：低级计算机视觉任务的未加工图像

论文标题

CIE XYZ NET：低级计算机视觉任务的未加工图像

CIE XYZ Net: Unprocessing Images for Low-Level Computer Vision Tasks

论文作者

Afifi, Mahmoud, Abdelhamed, Abdelrahman, Abuolaim, Abdullah, Punnappurath, Abhijith, Brown, Michael S.

论文摘要

相机当前允许访问两个图像状态：（i）最小处理的线性RAW-RGB图像状态（即原始传感器数据）或（ii）高度处理的非线性图像状态（例如SRGB）。有许多计算机视觉任务与线性图像状态最有效，例如图像脱张和图像脱壳。不幸的是，绝大多数图像保存在非线性图像状态中。因此，已经提出了许多方法将非线性图像“未加入”到RAW-RGB状态。但是，现有的未加工方法具有缺点，因为RAW-RGB图像是特定于传感器的。结果，有必要知道哪个相机生产了SRGB输出，并使用为该传感器量身定制的方法或网络来正确处理它。本文通过利用另一个无法作为输出的相机图像状态来解决此限制，但可以在相机管道内使用。特别是，相机应用比色转换步骤，将RAW-RGB图像转换为基于CIE XYZ颜色空间的设备独立的空间，然后才能应用非线性照相。利用这种规范的图像状态，我们提出了一个深度学习框架，即CIE XYZ NET，可以将非线性图像毫无程序化回到规范的CIE XYZ图像。然后，该图像可以由任何低级计算机视觉运算符处理，并重新渲染回非线性图像。我们证明了CIE XYZ NET对几个低级视觉任务的有用性，并显示了通过此处理框架获得的显着收益。代码和数据集可在https://github.com/mahmoudnafifi/cie_xyz_net上公开获取。

Cameras currently allow access to two image states: (i) a minimally processed linear raw-RGB image state (i.e., raw sensor data) or (ii) a highly-processed nonlinear image state (e.g., sRGB). There are many computer vision tasks that work best with a linear image state, such as image deblurring and image dehazing. Unfortunately, the vast majority of images are saved in the nonlinear image state. Because of this, a number of methods have been proposed to "unprocess" nonlinear images back to a raw-RGB state. However, existing unprocessing methods have a drawback because raw-RGB images are sensor-specific. As a result, it is necessary to know which camera produced the sRGB output and use a method or network tailored for that sensor to properly unprocess it. This paper addresses this limitation by exploiting another camera image state that is not available as an output, but it is available inside the camera pipeline. In particular, cameras apply a colorimetric conversion step to convert the raw-RGB image to a device-independent space based on the CIE XYZ color space before they apply the nonlinear photo-finishing. Leveraging this canonical image state, we propose a deep learning framework, CIE XYZ Net, that can unprocess a nonlinear image back to the canonical CIE XYZ image. This image can then be processed by any low-level computer vision operator and re-rendered back to the nonlinear image. We demonstrate the usefulness of the CIE XYZ Net on several low-level vision tasks and show significant gains that can be obtained by this processing framework. Code and dataset are publicly available at https://github.com/mahmoudnafifi/CIE_XYZ_NET.

下载PDF全文

下载文献需遵守相关版权规定

论文标题