遮盖自编码器作为图像处理器 (Masked Autoencoders as Image Processors)

Transformers have shown significant effectiveness for various vision tasks including both high-level vision and low-level vision. Recently, masked autoencoders (MAE) for feature pre-training have further unleashed the potential of Transformers, leading to state-of-the-art performances on various high-level vision tasks. However, the significance of MAE pre-training on low-level vision tasks has not been sufficiently explored. In this paper, we show that masked autoencoders are also scalable self-supervised learners for image processing tasks. We first present an efficient Transformer model considering both channel attention and shifted-window-based self-attention termed CSformer. Then we develop an effective MAE architecture for image processing (MAEIP) tasks. Extensive experimental results show that with the help of MAEIP pre-training, our proposed CSformer achieves state-of-the-art performance on various image processing tasks, including Gaussian denoising, real image denoising, single-image motion deblurring, defocus deblurring, and image deraining.

翻译：近年来，变压器在许多视觉任务中显示出了显著的有效性，包括高级别视觉和低级别视觉。最近，用于特征预训练的遮盖自编码器（MAE）进一步释放了变压器的潜力，从而在各种高级别视觉任务中实现了最先进的性能。但是，MAE预训练在低级别视觉任务中的重要性尚未得到充分探索。在本文中，我们展示了遮盖自编码器也是可扩展的自监督学习器，可用于图像处理任务。我们首先提出了一种高效的变压器模型，考虑通道注意力和基于移位窗口的自我注意力（ReZero机制）, 称为CSformer。然后，我们开发了一种有效的用于图像处理的遮盖自编码器架构（MAEIP）。广泛的实验结果表明，在MAEIP预训练的帮助下，我们提出的CSformer在各种图像处理任务中实现了最先进的性能，包括高斯去噪、实际图像去噪、单图像运动去模糊、虚焦去模糊和图像去雨。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日