以动态方案拟订方法为基础的无限地平线问题数字近近似值的最佳最佳界限 (Optimal bounds for numerical approximations of infinite horizon problems based on dynamic programming approach) - 专知论文

会员服务 ·

0

离散化 · 价值函数 · CASE · 近似 · 泛函 ·

2021 年 11 月 14 日

Optimal bounds for numerical approximations of infinite horizon problems based on dynamic programming approach

翻译：以动态方案拟订方法为基础的无限地平线问题数字近近似值的最佳最佳界限

Javier de Frutos,Julia Novo

from arxiv, 14 pages

In this paper we get error bounds for fully discrete approximations of infinite horizon problems via the dynamic programming approach. It is well known that considering a time discretization with a positive step size $h$ an error bound of size $h$ can be proved for the difference between the value function (viscosity solution of the Hamilton-Jacobi-Bellman equation corresponding to the infinite horizon) and the value function of the discrete time problem. However, including also a spatial discretization based on elements of size $k$ an error bound of size $O(k/h)$ can be found in the literature for the error between the value functions of the continuous problem and the fully discrete problem. In this paper we revise the error bound of the fully discrete method and prove, under similar assumptions to those of the time discrete case, that the error of the fully discrete case is in fact $O(h+k)$ which gives first order in time and space for the method. This error bound matches the numerical experiments of many papers in the literature in which the behaviour $1/h$ from the bound $O(k/h)$ have not been observed.

翻译：在本文中,我们通过动态编程方法获得无限地平线问题完全离散近似值的错误界限;众所周知,如果考虑时间离散,且步骤大小为正数,则以美元为单位,则以美元为单位,则以无限地平线问题全离散的近似值为单位,如果价值函数(汉密尔顿-Jacobi-Bellman等方程式符合无限地平线)与离散时间问题的值函数(与远度相对应的汉密尔顿-Jacobi-Bellman等方程式的视觉溶解法)存在差错,则以美元(k/h)美元为单位,则以时间和空间为单位,包括空间离散的空间,则以美元(k/h)美元为单位,在文献中可以发现持续问题的价值函数与完全离散问题之间的差错。在本文件中,我们修订完全离散方法的误差,并在与时间离式假设的假设下证明,完全离散情况是美元(h)美元(k+k)美元(美元)的误差,使该方法在时间和空间上首次排序。这种误差与文献中的许多文件的数值实验与1美元/h/h)没有观察到。

0

相关内容

离散化

【AAAI2022】受限评委下双执行者的高效连续控制

【AAAI2022】受限评委下双执行者的高效连续控制

专知会员服务

17+阅读 · 2021年12月22日

【ICML2021】逆约束强化学习

专知会员服务

33+阅读 · 2021年9月7日

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【经典书】贝叶斯编程，378页pdf，Bayesian Programming

【经典书】贝叶斯编程，378页pdf，Bayesian Programming

专知会员服务

250+阅读 · 2020年5月18日

【经典书】C++解决问题第七版，1074pdf，Problem Solving with C++

【经典书】C++解决问题第七版，1074pdf，Problem Solving with C++

专知会员服务

77+阅读 · 2020年2月20日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

High order discontinuous cut finite element methods for linear hyperbolic conservation laws with an interface

Arxiv

0+阅读 · 2022年1月18日

Convergence of a robust deep FBSDE method for stochastic control

Arxiv

0+阅读 · 2022年1月18日

Least squares estimators based on the Adams method for stochastic differential equations with small Lévy noise

Arxiv

0+阅读 · 2022年1月18日

Limits and consistency of non-local and graph approximations to the Eikonal equation

Arxiv

0+阅读 · 2022年1月17日

Parametrized Convex Universal Approximators for Decision-Making Problems

Arxiv

0+阅读 · 2022年1月17日

Unconditionally optimal error estimate of a linearized variable-time-step BDF2 scheme for nonlinear parabolic equations

Arxiv

0+阅读 · 2022年1月16日

Lower bounds on the performance of online algorithms for relaxed packing problems

Arxiv

0+阅读 · 2022年1月16日

Eikonal depth: an optimal control approach to statistical depths

Arxiv

0+阅读 · 2022年1月14日

Consistent Approximations in Composite Optimization

Arxiv

0+阅读 · 2022年1月13日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI2022】受限评委下双执行者的高效连续控制

【AAAI2022】受限评委下双执行者的高效连续控制

专知会员服务

17+阅读 · 2021年12月22日

【ICML2021】逆约束强化学习

专知会员服务

33+阅读 · 2021年9月7日

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知会员服务

122+阅读 · 2020年5月30日

【经典书】贝叶斯编程，378页pdf，Bayesian Programming

【经典书】贝叶斯编程，378页pdf，Bayesian Programming

专知会员服务

250+阅读 · 2020年5月18日

【经典书】C++解决问题第七版，1074pdf，Problem Solving with C++

【经典书】C++解决问题第七版，1074pdf，Problem Solving with C++

专知会员服务

77+阅读 · 2020年2月20日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】用于持续多模态指令微调的动态课程化LoRA专家混合机制

生成模型中持续学习的综合综述

【斯坦福博士论文】通过以人为本的自然语言界面拓展 AI 的可及性

【新书】《LangChain生成式AI实战：使用 Python 与 LangGraph 构建大语言模型应用与高级智能体》

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

保序最优传输：Order-preserving Optimal Transport

保序最优传输：Order-preserving Optimal Transport

我爱读PAMI

6+阅读 · 2018年9月16日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

High order discontinuous cut finite element methods for linear hyperbolic conservation laws with an interface

Arxiv

0+阅读 · 2022年1月18日

Convergence of a robust deep FBSDE method for stochastic control

Arxiv

0+阅读 · 2022年1月18日

Least squares estimators based on the Adams method for stochastic differential equations with small Lévy noise

Arxiv

0+阅读 · 2022年1月18日

Limits and consistency of non-local and graph approximations to the Eikonal equation

Arxiv

0+阅读 · 2022年1月17日

Parametrized Convex Universal Approximators for Decision-Making Problems

Arxiv

0+阅读 · 2022年1月17日

Unconditionally optimal error estimate of a linearized variable-time-step BDF2 scheme for nonlinear parabolic equations

Arxiv

0+阅读 · 2022年1月16日

Lower bounds on the performance of online algorithms for relaxed packing problems

Arxiv

0+阅读 · 2022年1月16日

Eikonal depth: an optimal control approach to statistical depths

Arxiv

0+阅读 · 2022年1月14日

Consistent Approximations in Composite Optimization

Arxiv

0+阅读 · 2022年1月13日

Variational Bayesian Reinforcement Learning with Regret Bounds

Arxiv

3+阅读 · 2018年7月25日

微信扫码咨询专知VIP会员