【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL) - 专知

会员服务 ·

0

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

2022 年 1 月 11 日 深度强化学习实验室

深度强化学习实验室

官网 ：http://www.neurondance.com/

论坛： http://deeprl.neurondance.com/

文章来源: https://www.distributional-rl.org/

排版：OpenDeepRL

This textbook aims to provide an introduction to the developing field of distributional reinforcement learning. The version provided below is a draft, currently under review at MIT Press.

The draft is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

We are grateful to all the people who helped make this book a reality – a full list will be provided in the final version of the book.

Distributional Reinforcement Learning

Table of Contents
1 Introduction
2 The Distribution of Returns
3 Learning the Return Distribution
4 Operators and Metrics
5 Distributional Dynamic Programming
6 Incremental Algorithms
7 Optimal Control
8 Statistical Functionals
9 Linear Function Approximation
10 Deep Reinforcement Learning
11 Looking Forward
Notation
Bibliography

FAQ and Caveat emptor

Can I get a PDF of this book? Why this format for the web version of the book?
Our agreement with the publisher allows us to make the draft available, but not as a PDF. This format gives access to the work to researchers who cannot readily purchase the published book.
When will the final version be available?
The book is still under submission and we are actively revising it based upon your feedback. It would be jinxing things to commit to a firm publication date.
Why are some pages strangely formatted?
We are aware of an excess of blank space on some pages – consider this part of enjoying reading a draft copy!
How do I provide feedback?
We welcome feedback and questions on all parts of the book (and in particular typographical issues and technical points). The preferred mode of communication is to email us at distributionalrl@gmail.com.

Citing the book

To cite this book, please use this bibtex entry:

@book{bdr2022,
    title={Distributional Reinforcement Learning},
    author={Marc G. Bellemare and Will Dabney and Mark Rowland},
    publisher={MIT Press},
    note={\url{http://www.distributional-rl.org}},
    year={2022}
}

书籍目录： pdf下载链接请在文章末尾点击“ 阅读原文 ”

登录查看更多

1

相关内容

强化学习

强化学习（RL）是机器学习的一个领域，与软件代理应如何在环境中采取行动以最大化累积奖励的概念有关。除了监督学习和非监督学习外，强化学习是三种基本的机器学习范式之一。强化学习与监督学习的不同之处在于，不需要呈现带标签的输入/输出对，也不需要显式纠正次优动作。相反，重点是在探索（未知领域）和利用（当前知识）之间找到平衡。该环境通常以马尔可夫决策过程（MDP）的形式陈述，因为针对这种情况的许多强化学习算法都使用动态编程技术。经典动态规划方法和强化学习算法之间的主要区别在于，后者不假设MDP的确切数学模型，并且针对无法采用精确方法的大型MDP。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

101+阅读 · 2020年11月12日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

148+阅读 · 2020年8月7日

多伦多大学2020春季CSC311课程「机器学习导论」，学习ML基础知识

多伦多大学2020春季CSC311课程「机器学习导论」，学习ML基础知识

专知会员服务

51+阅读 · 2020年1月13日

新书分享：强化学习最新书稿《强化学习导论》（Reinforcement Learning An Introduction）第二版出炉

新书分享：强化学习最新书稿《强化学习导论》（Reinforcement Learning An Introduction）第二版出炉

专知会员服务

111+阅读 · 2019年10月25日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

99+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

270+阅读 · 2019年10月9日

【强化学习落地应用】之FinRL生态系统，一种使用强化学习进行自动化交易的实践

【强化学习落地应用】之FinRL生态系统，一种使用强化学习进行自动化交易的实践

深度强化学习实验室

1+阅读 · 2022年2月21日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

OpenAI官方发布：强化学习中的关键论文

OpenAI官方发布：强化学习中的关键论文

专知

14+阅读 · 2018年12月12日

《模式识别与机器学习(PRML)》正式开放免费下载

《模式识别与机器学习(PRML)》正式开放免费下载

AINLP

27+阅读 · 2018年11月27日

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

专知

29+阅读 · 2018年4月10日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

资源｜斯坦福课程：深度学习理论！

资源｜斯坦福课程：深度学习理论！

全球人工智能

17+阅读 · 2017年11月9日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

19+阅读 · 2017年10月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

渤海湾海底沉积物中放线菌多样性及抗菌活性的初步研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于强化学习的分布参数系统数据驱动控制

国家自然科学基金

5+阅读 · 2015年12月31日

多部门机构下的生产规划与资源配置

国家自然科学基金

2+阅读 · 2014年12月31日

北极海冰假交替单胞菌属细菌的多样性、系统分类及生态适应的遗传与生理基础

国家自然科学基金

0+阅读 · 2014年12月31日

城市建筑群空间分布模式的识别方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

城市交通系统中停车换乘设施布局与运营优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂网络模块结构与链接结构的统计建模及识别研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于任务尺度的分布时钟同步策略研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国石油资源流动空间格局演化规律与形成机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

煤炭资源优化配置的理论与政策研究

国家自然科学基金

2+阅读 · 2008年12月31日

Deep Reinforcement Learning for a Two-Echelon Supply Chain with Seasonal Demand

Arxiv

0+阅读 · 2022年4月20日

A Deeper Look into Aleatoric and Epistemic Uncertainty Disentanglement

Arxiv

0+阅读 · 2022年4月20日

When Is Partially Observable Reinforcement Learning Not Scary?

Arxiv

0+阅读 · 2022年4月19日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Arxiv

1+阅读 · 2022年4月18日

Methodical Advice Collection and Reuse in Deep Reinforcement Learning

Arxiv

1+阅读 · 2022年4月14日

Bayesian Deep Learning for Graphs

Arxiv

21+阅读 · 2022年2月24日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

19+阅读 · 2020年3月10日

A Survey of the Usages of Deep Learning in Natural Language Processing

A Survey of the Usages of Deep Learning in Natural Language Processing

Arxiv

118+阅读 · 2019年9月11日

VIP会员

相关主题

深度强化学习

知识共享（Creative Commons）

相关VIP内容

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

101+阅读 · 2020年11月12日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

148+阅读 · 2020年8月7日

多伦多大学2020春季CSC311课程「机器学习导论」，学习ML基础知识

多伦多大学2020春季CSC311课程「机器学习导论」，学习ML基础知识

专知会员服务

51+阅读 · 2020年1月13日

新书分享：强化学习最新书稿《强化学习导论》（Reinforcement Learning An Introduction）第二版出炉

新书分享：强化学习最新书稿《强化学习导论》（Reinforcement Learning An Introduction）第二版出炉

专知会员服务

111+阅读 · 2019年10月25日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

144+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

90+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

99+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

270+阅读 · 2019年10月9日

热门VIP内容

相关资讯

【强化学习落地应用】之FinRL生态系统，一种使用强化学习进行自动化交易的实践

【强化学习落地应用】之FinRL生态系统，一种使用强化学习进行自动化交易的实践

深度强化学习实验室

1+阅读 · 2022年2月21日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

OpenAI官方发布：强化学习中的关键论文

OpenAI官方发布：强化学习中的关键论文

专知

14+阅读 · 2018年12月12日

《模式识别与机器学习(PRML)》正式开放免费下载

《模式识别与机器学习(PRML)》正式开放免费下载

AINLP

27+阅读 · 2018年11月27日

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

Richard S. Sutton经典图书：《强化学习导论》第二版（附PDF下载）

专知

29+阅读 · 2018年4月10日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

资源｜斯坦福课程：深度学习理论！

资源｜斯坦福课程：深度学习理论！

全球人工智能

17+阅读 · 2017年11月9日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

19+阅读 · 2017年10月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关基金

渤海湾海底沉积物中放线菌多样性及抗菌活性的初步研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于强化学习的分布参数系统数据驱动控制

国家自然科学基金

5+阅读 · 2015年12月31日

多部门机构下的生产规划与资源配置

国家自然科学基金

2+阅读 · 2014年12月31日

北极海冰假交替单胞菌属细菌的多样性、系统分类及生态适应的遗传与生理基础

国家自然科学基金

0+阅读 · 2014年12月31日

城市建筑群空间分布模式的识别方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

城市交通系统中停车换乘设施布局与运营优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

复杂网络模块结构与链接结构的统计建模及识别研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于任务尺度的分布时钟同步策略研究

国家自然科学基金

0+阅读 · 2011年12月31日

中国石油资源流动空间格局演化规律与形成机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

煤炭资源优化配置的理论与政策研究

国家自然科学基金

2+阅读 · 2008年12月31日

相关论文

Deep Reinforcement Learning for a Two-Echelon Supply Chain with Seasonal Demand

Arxiv

0+阅读 · 2022年4月20日

A Deeper Look into Aleatoric and Epistemic Uncertainty Disentanglement

Arxiv

0+阅读 · 2022年4月20日

When Is Partially Observable Reinforcement Learning Not Scary?

Arxiv

0+阅读 · 2022年4月19日

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

Arxiv

1+阅读 · 2022年4月18日

Methodical Advice Collection and Reuse in Deep Reinforcement Learning

Arxiv

1+阅读 · 2022年4月14日

Bayesian Deep Learning for Graphs

Arxiv

21+阅读 · 2022年2月24日

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Arxiv

21+阅读 · 2021年9月2日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

19+阅读 · 2020年3月10日

A Survey of the Usages of Deep Learning in Natural Language Processing

A Survey of the Usages of Deep Learning in Natural Language Processing

Arxiv

118+阅读 · 2019年9月11日

大家都在搜

图与推荐指南针

大型语言模型

李清照词作

基于几何特征的激光雷达地面点云分割

微信扫码咨询专知VIP会员