预训练带有遮掩自编码器的天气模型W-MAE用于多变量天气预测 (W-MAE: Pre-trained weather model with masked autoencoder for multi-variable weather forecasting) - 专知论文

会员服务 ·

0

掩码自编码MAE · 多变量 · StNet · 自编码器 · 预训练 ·

2023 年 4 月 18 日

W-MAE: Pre-trained weather model with masked autoencoder for multi-variable weather forecasting

翻译：预训练带有遮掩自编码器的天气模型W-MAE用于多变量天气预测

Xin Man,Chenghong Zhang,Changyu Li,Jie Shao

Weather forecasting is a long-standing computational challenge with direct societal and economic impacts. This task involves a large amount of continuous data collection and exhibits rich spatiotemporal dependencies over long periods, making it highly suitable for deep learning models. In this paper, we apply pre-training techniques to weather forecasting and propose W-MAE, a Weather model with Masked AutoEncoder pre-training for multi-variable weather forecasting. W-MAE is pre-trained in a self-supervised manner to reconstruct spatial correlations within meteorological variables. On the temporal scale, we fine-tune the pre-trained W-MAE to predict the future states of meteorological variables, thereby modeling the temporal dependencies present in weather data. We pre-train W-MAE using the fifth-generation ECMWF Reanalysis (ERA5) data, with samples selected every six hours and using only two years of data. Under the same training data conditions, we compare W-MAE with FourCastNet, and W-MAE outperforms FourCastNet in precipitation forecasting. In the setting where the training data is far less than that of FourCastNet, our model still performs much better in precipitation prediction (0.80 vs. 0.98). Additionally, experiments show that our model has a stable and significant advantage in short-to-medium-range forecasting (i.e., forecasting time ranges from 6 hours to one week), and the longer the prediction time, the more evident the performance advantage of W-MAE, further proving its robustness.

翻译：天气预报是一项长期存在的计算挑战，具有直接的社会和经济影响。这项任务涉及大量连续数据收集，并展现出较长时间的丰富的时空依赖性，使其非常适合深度学习模型。在本文中，我们将预训练技术应用于天气预测，并提出了W-MAE模型，这是一种带有遮蔽自编码器预训练的多变量天气预测模型。W-MAE以自监督的方式进行预训练，以重构气象变量内的空间相关性。在时间尺度上，我们微调预先训练的W-MAE以预测气象变量的未来状态，从而建模天气数据中存在的时间相关性。我们使用第五代ECMWF重分析（ERA5）数据对W-MAE进行预先训练，每隔六小时选择样本，并仅使用两年的数据。在与FourCastNet使用相同的训练数据条件下，我们比较了W-MAE和FourCastNet，并且W-MAE在降水预测方面胜过了FourCastNet。在训练数据远少于FourCastNet的情况下，我们的模型在降水预测方面仍然表现得更好（0.80与0.98）。此外，实验表明，我们的模型在短到中期的预测（即，预测时间范围从6小时到一周）中具有稳定和显着的优势，预测时间越长，W-MAE 的性能优势越明显，进一步证明了其鲁棒性。

0

相关内容

掩码自编码MAE

掩码自编码MAE

掩码自编码MAE

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

专知会员服务

86+阅读 · 2022年5月5日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【AAAI2022】基于分层随机注意的Transformer 不确定性估计

【AAAI2022】基于分层随机注意的Transformer 不确定性估计

专知会员服务

29+阅读 · 2021年12月29日

【CMU博士论文】用动态超参数优化改进深度学习训练和推理，Improving Deep Learning Training and Inference with Dynamic Hyperparameter Optimization

【CMU博士论文】用动态超参数优化改进深度学习训练和推理，Improving Deep Learning Training and Inference with Dynamic Hyperparameter Optimization

专知会员服务

55+阅读 · 2020年5月26日

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

专知会员服务

174+阅读 · 2020年5月1日

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

专知会员服务

142+阅读 · 2020年4月30日

【Google】监督对比学习，Supervised Contrastive Learning

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

基于分子进化的蛋白质共进化高维互信息模型

国家自然科学基金

4+阅读 · 2015年12月31日

云的可降水概率遥感分析及在气象干旱中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

复杂非线性过程潜在初始故障的监测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多源数据融合和非负不等式约束的全球电离层精细建模研究

国家自然科学基金

0+阅读 · 2013年12月31日

珠三角地区短期天气过程中大气气溶胶对云和降水的影响

国家自然科学基金

0+阅读 · 2012年12月31日

基于多源数据的电离层三维精细建模及震前电离层异常时空分布规律和触发机制探究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模非平稳多元混沌时间序列分析与建模研究

国家自然科学基金

2+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

非平稳时间序列的非参数预测回归

国家自然科学基金

7+阅读 · 2012年12月31日

基于周期信息的时间序列缺失值填补方法研究

国家自然科学基金

1+阅读 · 2008年12月31日

Contrastive Shapelet Learning for Unsupervised Multivariate Time Series Representation Learning

Arxiv

0+阅读 · 2023年6月2日

Counting Crowds in Bad Weather

Arxiv

0+阅读 · 2023年6月2日

A Novel Driver Distraction Behavior Detection Based on Self-Supervised Learning Framework with Masked Image Modeling

Arxiv

0+阅读 · 2023年6月1日

Generalizable Memory-driven Transformer for Multivariate Long Sequence Time-series Forecasting

Arxiv

0+阅读 · 2023年5月31日

Forecasting Evolution of Clusters in Game Agents with Hebbian Learning

Arxiv

0+阅读 · 2023年5月31日

Multimodal Learning with Transformers: A Survey

Arxiv

69+阅读 · 2022年6月13日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Invariant Information Bottleneck for Domain Generalization

Invariant Information Bottleneck for Domain Generalization

Arxiv

15+阅读 · 2021年12月10日

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classication

Arxiv

17+阅读 · 2021年6月2日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

掩码自编码MAE

相关VIP内容

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

专知会员服务

86+阅读 · 2022年5月5日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【AAAI2022】基于分层随机注意的Transformer 不确定性估计

【AAAI2022】基于分层随机注意的Transformer 不确定性估计

专知会员服务

29+阅读 · 2021年12月29日

【CMU博士论文】用动态超参数优化改进深度学习训练和推理，Improving Deep Learning Training and Inference with Dynamic Hyperparameter Optimization

【CMU博士论文】用动态超参数优化改进深度学习训练和推理，Improving Deep Learning Training and Inference with Dynamic Hyperparameter Optimization

专知会员服务

55+阅读 · 2020年5月26日

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

专知会员服务

174+阅读 · 2020年5月1日

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

专知会员服务

142+阅读 · 2020年4月30日

【Google】监督对比学习，Supervised Contrastive Learning

【Google】监督对比学习，Supervised Contrastive Learning

专知会员服务

75+阅读 · 2020年4月24日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

【Amazon】使用预先训练的Transformer模型进行数据增强，Data Augmentation using Pre-trained Transformer Models

专知会员服务

51+阅读 · 2020年3月7日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】(Keras)LSTM多元时序预测教程

【推荐】(Keras)LSTM多元时序预测教程

机器学习研究会

24+阅读 · 2017年8月14日

相关论文

Contrastive Shapelet Learning for Unsupervised Multivariate Time Series Representation Learning

Arxiv

0+阅读 · 2023年6月2日

Counting Crowds in Bad Weather

Arxiv

0+阅读 · 2023年6月2日

A Novel Driver Distraction Behavior Detection Based on Self-Supervised Learning Framework with Masked Image Modeling

Arxiv

0+阅读 · 2023年6月1日

Generalizable Memory-driven Transformer for Multivariate Long Sequence Time-series Forecasting

Arxiv

0+阅读 · 2023年5月31日

Forecasting Evolution of Clusters in Game Agents with Hebbian Learning

Arxiv

0+阅读 · 2023年5月31日

Multimodal Learning with Transformers: A Survey

Arxiv

69+阅读 · 2022年6月13日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Invariant Information Bottleneck for Domain Generalization

Invariant Information Bottleneck for Domain Generalization

Arxiv

15+阅读 · 2021年12月10日

TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classication

Arxiv

17+阅读 · 2021年6月2日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

相关基金

基于分子进化的蛋白质共进化高维互信息模型

国家自然科学基金

4+阅读 · 2015年12月31日

云的可降水概率遥感分析及在气象干旱中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

复杂非线性过程潜在初始故障的监测方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于多源数据融合和非负不等式约束的全球电离层精细建模研究

国家自然科学基金

0+阅读 · 2013年12月31日

珠三角地区短期天气过程中大气气溶胶对云和降水的影响

国家自然科学基金

0+阅读 · 2012年12月31日

基于多源数据的电离层三维精细建模及震前电离层异常时空分布规律和触发机制探究

国家自然科学基金

0+阅读 · 2012年12月31日

大规模非平稳多元混沌时间序列分析与建模研究

国家自然科学基金

2+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

非平稳时间序列的非参数预测回归

国家自然科学基金

7+阅读 · 2012年12月31日

基于周期信息的时间序列缺失值填补方法研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员