论文标题
FIELFORSER:基于环切片的变压器用于鱼眼矫正和功效域探索
FishFormer: Annulus Slicing-based Transformer for Fisheye Rectification with Efficacy Domain Exploration
论文作者
论文摘要
通过CNN取得了许多关于鱼眼图像纠正的重大进展。然而,受到固定的接受场的约束,失真的全局分布和局部对称性尚未得到充分利用。为了利用这两个特征,我们引入了将鱼眼图像作为增强全球和局部感知的序列进行处理的鱼眼。我们根据鱼眼图像的结构特性对变压器进行了调整。首先,现有的正方形切片方法生成的贴片中的不均匀变形分布使网络混淆,从而导致了艰难的训练。因此,我们提出了一种环形切片方法,以维持每个斑块中失真的一致性,从而很好地感知了失真分布。其次,我们分析不同的失真参数具有自己的疗效域。因此,对本地区域的看法与全球一样重要,但是变压器对于局部纹理感知有弱点。因此,我们提出了一种新的层注意机制,以增强局部感知和纹理转移。我们的网络同时实现了全球感知,并以不同参数决定的本地感知重点。广泛的实验表明,与最先进的方法相比,我们的方法提供了卓越的性能。
Numerous significant progress on fisheye image rectification has been achieved through CNN. Nevertheless, constrained by a fixed receptive field, the global distribution and the local symmetry of the distortion have not been fully exploited. To leverage these two characteristics, we introduced Fishformer that processes the fisheye image as a sequence to enhance global and local perception. We tuned the Transformer according to the structural properties of fisheye images. First, the uneven distortion distribution in patches generated by the existing square slicing method confuses the network, resulting in difficult training. Therefore, we propose an annulus slicing method to maintain the consistency of the distortion in each patch, thus perceiving the distortion distribution well. Second, we analyze that different distortion parameters have their own efficacy domains. Hence, the perception of the local area is as important as the global, but Transformer has a weakness for local texture perception. Therefore, we propose a novel layer attention mechanism to enhance the local perception and texture transfer. Our network simultaneously implements global perception and focused local perception decided by the different parameters. Extensive experiments demonstrate that our method provides superior performance compared with state-of-the-art methods.