深度梯度流方法求解偏微分方程时泛化误差的收敛性 (Convergence of the generalization error for deep gradient flow methods for PDEs) - 专知论文

会员服务 ·

0

梯度 · 泛化 · 泛化误差 · 近似 · 近似误差 ·

2025 年 12 月 31 日

Convergence of the generalization error for deep gradient flow methods for PDEs

翻译：深度梯度流方法求解偏微分方程时泛化误差的收敛性

Chenguang Liu,Antonis Papapantoleon,Jasper Rou

from arxiv, 28 pages

The aim of this article is to provide a firm mathematical foundation for the application of deep gradient flow methods (DGFMs) for the solution of (high-dimensional) partial differential equations (PDEs). We decompose the generalization error of DGFMs into an approximation and a training error. We first show that the solution of PDEs that satisfy reasonable and verifiable assumptions can be approximated by neural networks, thus the approximation error tends to zero as the number of neurons tends to infinity. Then, we derive the gradient flow that the training process follows in the ``wide network limit'' and analyze the limit of this flow as the training time tends to infinity. These results combined show that the generalization error of DGFMs tends to zero as the number of neurons and the training time tend to infinity.

翻译：本文旨在为深度梯度流方法（DGFMs）求解（高维）偏微分方程（PDEs）提供坚实的数学基础。我们将DGFMs的泛化误差分解为近似误差和训练误差。首先证明，满足合理且可验证假设的偏微分方程解可由神经网络逼近，因此当神经元数量趋于无穷时，近似误差趋于零。随后，推导出训练过程在“宽网络极限”下遵循的梯度流，并分析该流在训练时间趋于无穷时的极限。综合这些结果表明，当神经元数量和训练时间均趋于无穷时，DGFMs的泛化误差趋于零。

0

相关内容

梯度的本意是一个向量（矢量），表示某一函数在该点处的方向导数沿着该方向取得最大值，即函数在该点处沿着该方向（此梯度的方向）变化最快，变化率最大（为该梯度的模）。

深度线性神经网络的梯度流方程：一项基于网络视角的综述

深度线性神经网络的梯度流方程：一项基于网络视角的综述

专知会员服务

8+阅读 · 2025年11月14日

【ICML2024】基于正则化的持续学习的统计理论

【ICML2024】基于正则化的持续学习的统计理论

专知会员服务

21+阅读 · 2024年6月11日

【ICML2021】基于低秩重参数化的大规模私有学习

专知会员服务

12+阅读 · 2021年6月20日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

专知

12+阅读 · 2020年6月9日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

从泰勒展开来看梯度下降算法

从泰勒展开来看梯度下降算法

深度学习每日摘要

13+阅读 · 2019年4月9日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

MNIST入门：贝叶斯方法

MNIST入门：贝叶斯方法

Python程序员

23+阅读 · 2017年7月3日

光滑函数类的熵数估计

国家自然科学基金

0+阅读 · 2015年12月31日

随机系数和带跳的线性随机微分系统的H2/H∞控制

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

解析函数空间上的Toeplitz型奇异积分算子

国家自然科学基金

0+阅读 · 2014年12月31日

弹性应变梯度问题的有限元方法

国家自然科学基金

0+阅读 · 2014年12月31日

Concentration Inequalities for Stochastic Optimization of Unbounded Objective Functions with Application to Denoising Score Matching

Arxiv

0+阅读 · 2025年12月31日

Stochastic Gradient Descent for Nonparametric Additive Regression

Arxiv

0+阅读 · 2025年12月30日

Fundamental limits for weighted empirical approximations of tilted distributions

Arxiv

0+阅读 · 2025年12月30日

Optimal estimation for regression discontinuity design with binary outcomes

Arxiv

0+阅读 · 2025年12月26日

Total Normal Curvature Regularization and its Minimization for Surface and Image Smoothing

Arxiv

0+阅读 · 2025年12月25日

VIP会员

文章信息

相关主题

相关VIP内容

深度线性神经网络的梯度流方程：一项基于网络视角的综述

深度线性神经网络的梯度流方程：一项基于网络视角的综述

专知会员服务

8+阅读 · 2025年11月14日

【ICML2024】基于正则化的持续学习的统计理论

【ICML2024】基于正则化的持续学习的统计理论

专知会员服务

21+阅读 · 2024年6月11日

【ICML2021】基于低秩重参数化的大规模私有学习

专知会员服务

12+阅读 · 2021年6月20日

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

MonoGRNet：单目3D目标检测的通用框架（TPAMI2021）

专知会员服务

18+阅读 · 2021年5月3日

图机器学习-图拉普拉斯算子的离散正则性，141页ppt，Discrete regularity graph Laplacians

专知会员服务

29+阅读 · 2020年6月4日

热门VIP内容

开通专知VIP会员享更多权益服务

生成式人工智能导论：可靠性、负责任开发及实际应用（第二版）

《2025财年美陆军转型倡议（ATI）部队结构与组织提案》

【CMU博士论文】分布偏移下的可信机器学习

智能体 EDA 的曙光：自主数字芯片设计综述

相关资讯

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

专知

12+阅读 · 2020年6月9日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

从泰勒展开来看梯度下降算法

从泰勒展开来看梯度下降算法

深度学习每日摘要

13+阅读 · 2019年4月9日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

MNIST入门：贝叶斯方法

MNIST入门：贝叶斯方法

Python程序员

23+阅读 · 2017年7月3日

相关论文

Concentration Inequalities for Stochastic Optimization of Unbounded Objective Functions with Application to Denoising Score Matching

Arxiv

0+阅读 · 2025年12月31日

Stochastic Gradient Descent for Nonparametric Additive Regression

Arxiv

0+阅读 · 2025年12月30日

Fundamental limits for weighted empirical approximations of tilted distributions

Arxiv

0+阅读 · 2025年12月30日

Optimal estimation for regression discontinuity design with binary outcomes

Arxiv

0+阅读 · 2025年12月26日

Total Normal Curvature Regularization and its Minimization for Surface and Image Smoothing

Arxiv

0+阅读 · 2025年12月25日

相关基金

光滑函数类的熵数估计

国家自然科学基金

0+阅读 · 2015年12月31日

随机系数和带跳的线性随机微分系统的H2/H∞控制

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

解析函数空间上的Toeplitz型奇异积分算子

国家自然科学基金

0+阅读 · 2014年12月31日

弹性应变梯度问题的有限元方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员