越线RL:样本高效神经功能近似 (Going Beyond Linear RL: Sample Efficient Neural Function Approximation) - 专知论文

会员服务 ·

0

近似 · 泛函 · 线性的 · Neural Networks · Q函数 ·

2021 年 7 月 14 日

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

翻译：越线RL:样本高效神经功能近似

Baihe Huang,Kaixuan Huang,Sham M. Kakade,Jason D. Lee,Qi Lei,Runzhe Wang,Jiaqi Yang

Deep Reinforcement Learning (RL) powered by neural net approximation of the Q function has had enormous empirical success. While the theory of RL has traditionally focused on linear function approximation (or eluder dimension) approaches, little is known about nonlinear RL with neural net approximations of the Q functions. This is the focus of this work, where we study function approximation with two-layer neural networks (considering both ReLU and polynomial activation functions). Our first result is a computationally and statistically efficient algorithm in the generative model setting under completeness for two-layer neural networks. Our second result considers this setting but under only realizability of the neural net function class. Here, assuming deterministic dynamics, the sample complexity scales linearly in the algebraic dimension. In all cases, our results significantly improve upon what can be attained with linear (or eluder dimension) methods.

翻译：以Q函数神经网近似为动力的深度强化学习(RL)获得了巨大的经验成功。虽然RL理论传统上侧重于线性函数近似(或快率维度)方法,但对于Q函数神经网近似的非线性RL却鲜为人知。这就是这项工作的重点,我们在这里研究与两层神经网络的近似功能(同时考虑RELU和多线性激活功能)。我们的第一个结果是在两层神经网络完整的情况下,在基因化模型设置中进行计算和统计效率高的算法。我们的第二个结果考虑了这一设置,但只是在神经网功能等级的可变性之下。在这里,假设确定性动态,在代数层面的样本复杂度线性尺度线性(或精灵维度)方法所能达到的程度上,我们的结果大有改进。

0

相关内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！最新《应用机器学习》课程，讲述实战机器学习系统部署

不可错过！最新《应用机器学习》课程，讲述实战机器学习系统部署

专知会员服务

37+阅读 · 2020年12月1日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

173+阅读 · 2020年11月13日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

已删除

将门创投

11+阅读 · 2019年8月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Backward diffusion-wave problem: stability, regularization and approximation

Arxiv

0+阅读 · 2021年9月15日

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Arxiv

0+阅读 · 2021年9月14日

PAC Learnability of Approximate Nash Equilibrium in Bimatrix Games

Arxiv

0+阅读 · 2021年9月14日

Optimal pointwise sampling for $L^2$ approximation

Arxiv

0+阅读 · 2021年9月13日

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

Arxiv

0+阅读 · 2021年9月10日

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Arxiv

0+阅读 · 2021年9月10日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Efficient Road Lane Marking Detection with Deep Learning

Efficient Road Lane Marking Detection with Deep Learning

Arxiv

5+阅读 · 2018年9月11日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！最新《应用机器学习》课程，讲述实战机器学习系统部署

不可错过！最新《应用机器学习》课程，讲述实战机器学习系统部署

专知会员服务

37+阅读 · 2020年12月1日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

173+阅读 · 2020年11月13日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

已删除

将门创投

11+阅读 · 2019年8月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

RL 真经

CreateAMind

5+阅读 · 2018年12月28日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Backward diffusion-wave problem: stability, regularization and approximation

Arxiv

0+阅读 · 2021年9月15日

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Arxiv

0+阅读 · 2021年9月14日

PAC Learnability of Approximate Nash Equilibrium in Bimatrix Games

Arxiv

0+阅读 · 2021年9月14日

Optimal pointwise sampling for $L^2$ approximation

Arxiv

0+阅读 · 2021年9月13日

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

DeepOPF: A Feasibility-Optimized Deep Neural Network Approach for AC Optimal Power Flow Problems

Arxiv

0+阅读 · 2021年9月10日

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks

Arxiv

0+阅读 · 2021年9月10日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

Approximation Ratios of Graph Neural Networks for Combinatorial Problems

Arxiv

7+阅读 · 2019年5月24日

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation

Arxiv

8+阅读 · 2018年12月18日

Efficient Road Lane Marking Detection with Deep Learning

Efficient Road Lane Marking Detection with Deep Learning

Arxiv

5+阅读 · 2018年9月11日

微信扫码咨询专知VIP会员