多焦点图像融合渐渐了解层叠加网络 (Gradient Aware Cascade Network for Multi-Focus Image Fusion)

The general aim of multi-focus image fusion is to gather focused regions of different images to generate a unique all-in-focus fused image. Deep learning based methods become the mainstream of image fusion by virtue of its powerful feature representation ability. However, most of the existing deep learning structures failed to balance fusion quality and end-to-end implementation convenience. End-to-end decoder design often leads to poor performance because of its non-linear mapping mechanism. On the other hand, generating an intermediate decision map achieves better quality for the fused image, but relies on the rectification with empirical post-processing parameter choices. In this work, to handle the requirements of both output image quality and comprehensive simplicity of structure implementation, we propose a cascade network to simultaneously generate decision map and fused result with an end-to-end training procedure. It avoids the dependence on empirical post-processing methods in the inference stage. To improve the fusion quality, we introduce a gradient aware loss function to preserve gradient information in output fused image. In addition, we design a decision calibration strategy to decrease the time consumption in the application of multiple image fusion. Extensive experiments are conducted to compare with 18 different state-of-the-art multi-focus image fusion structures with 6 assessment metrics. The results prove that our designed structure can generally ameliorate the output fused image quality, while implementation efficiency increases over 30\% for multiple image fusion.

翻译：多焦点图像融合的总目标是收集不同图像的焦点区域,以产生独特的全焦点组合图像。深层次学习方法由于其强大的特征代表能力而成为图像融合的主流。然而,现有的大部分深层次学习结构未能平衡融合质量和端到端执行便利。端到端的解码器设计往往因其非线性绘图机制而导致性能不佳。另一方面,制作中间决定图能提高引信图像的质量,但依赖经验性后处理参数选择的校正。在这项工作中,我们建议建立一个连锁网络,同时生成决定图,并将结果与端到端执行程序相结合。在推论阶段,最终端到端的解码设计设计往往导致工作业绩不佳。为了提高聚合质量,我们引入了梯度意识损失功能,以保存导出图像的精度信息质量,但依赖经验性后处理参数选择的校正。此外,我们设计了一个决定校正战略,以降低应用多图像质量质量和结构的时的消耗量,我们提议一个连锁的连锁网络,同时将结果结合一个端到端到端到端端点的模结构。全面实验可以进行试验,以测试到宽度的比重的30级的图像结构。