深入了解强化学习的教程介绍 (A Tutorial Introduction to Reinforcement Learning) - 专知论文

会员服务 ·

0

强化学习 · 随机逼近 · 教程 · 马尔可夫决策过程 · 算法 ·

2023 年 4 月 3 日

A Tutorial Introduction to Reinforcement Learning

翻译：深入了解强化学习的教程介绍

Mathukumalli Vidyasagar

from arxiv, 32 pages, 3 figures

In this paper, we present a brief survey of Reinforcement Learning (RL), with particular emphasis on Stochastic Approximation (SA) as a unifying theme. The scope of the paper includes Markov Reward Processes, Markov Decision Processes, Stochastic Approximation algorithms, and widely used algorithms such as Temporal Difference Learning and $Q$-learning.

翻译：本文简要介绍了强化学习（RL）的调查，特别强调随机近似（SA）作为统一主题。文章的范围包括马尔可夫奖励过程、马尔可夫决策过程、随机逼近算法以及广泛使用的算法，如时间差异学习和 $Q$-learning。

1

相关内容

强化学习

强化学习（RL）是机器学习的一个领域，与软件代理应如何在环境中采取行动以最大化累积奖励的概念有关。除了监督学习和非监督学习外，强化学习是三种基本的机器学习范式之一。强化学习与监督学习的不同之处在于，不需要呈现带标签的输入/输出对，也不需要显式纠正次优动作。相反，重点是在探索（未知领域）和利用（当前知识）之间找到平衡。该环境通常以马尔可夫决策过程（MDP）的形式陈述，因为针对这种情况的许多强化学习算法都使用动态编程技术。经典动态规划方法和强化学习算法之间的主要区别在于，后者不假设MDP的确切数学模型，并且针对无法采用精确方法的大型MDP。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

最新《强化学习导论》教程，32页pdf

最新《强化学习导论》教程，32页pdf

专知会员服务

58+阅读 · 2023年4月5日

强化学习的简要总结，18页pdf

强化学习的简要总结，18页pdf

专知会员服务

58+阅读 · 2023年1月7日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【COLT 2021- Tutorial】强化学习统计基础，140页ppt

专知会员服务

59+阅读 · 2021年8月8日

【限时开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【限时开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知会员服务

68+阅读 · 2020年7月15日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知会员服务

103+阅读 · 2020年1月3日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

最新《强化学习导论》教程，32页pdf

最新《强化学习导论》教程，32页pdf

专知

4+阅读 · 2023年4月5日

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

18+阅读 · 2022年2月3日

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知

11+阅读 · 2020年7月15日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知

16+阅读 · 2020年1月3日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

深度强化学习简介

深度强化学习简介

专知

30+阅读 · 2018年12月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

高超声速边界层中粗糙元强制转捩的机理

国家自然科学基金

0+阅读 · 2014年12月31日

西瓜低温诱导转录因子ClMYB的功能鉴定及其调控机制解析

国家自然科学基金

0+阅读 · 2014年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

微通道气液界面波不稳定性及其对沸腾换热影响机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于事件的强化学习及其在群机器人优化控制中的应用

国家自然科学基金

3+阅读 · 2012年12月31日

甲醇转化反应两种机理的对比研究

国家自然科学基金

0+阅读 · 2009年12月31日

大空间非平衡态等离子体燃烧点火及燃烧促进的研究

国家自然科学基金

0+阅读 · 2008年12月31日

TGF-β28608;活Myocardin家族诱导骨髓间充质干细胞分化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

11+阅读 · 2008年12月31日

GUARD: A Safe Reinforcement Learning Benchmark

Arxiv

0+阅读 · 2023年5月23日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

An Introduction to Autoencoders

Arxiv

17+阅读 · 2022年1月11日

Introduction to Online Convex Optimization

Arxiv

23+阅读 · 2021年12月19日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

VIP会员

文章信息

相关主题

马尔可夫决策过程

相关VIP内容

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

最新《强化学习导论》教程，32页pdf

最新《强化学习导论》教程，32页pdf

专知会员服务

58+阅读 · 2023年4月5日

强化学习的简要总结，18页pdf

强化学习的简要总结，18页pdf

专知会员服务

58+阅读 · 2023年1月7日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【2022新书】强化学习工业应用，408页pdf

【2022新书】强化学习工业应用，408页pdf

专知会员服务

231+阅读 · 2022年2月3日

【COLT 2021- Tutorial】强化学习统计基础，140页ppt

专知会员服务

59+阅读 · 2021年8月8日

【限时开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【限时开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知会员服务

68+阅读 · 2020年7月15日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知会员服务

103+阅读 · 2020年1月3日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

最新《强化学习导论》教程，32页pdf

最新《强化学习导论》教程，32页pdf

专知

4+阅读 · 2023年4月5日

【2022新书】强化学习工业应用

【2022新书】强化学习工业应用

专知

18+阅读 · 2022年2月3日

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知

11+阅读 · 2020年7月15日

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

55页图深度学习导论《A Gentle Introduction to Deep Learning for Graphs》

专知

16+阅读 · 2020年1月3日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

深度强化学习简介

深度强化学习简介

专知

30+阅读 · 2018年12月3日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

GUARD: A Safe Reinforcement Learning Benchmark

Arxiv

0+阅读 · 2023年5月23日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Automated Reinforcement Learning (AutoRL): A Survey and Open Problems

Arxiv

33+阅读 · 2022年1月11日

An Introduction to Autoencoders

Arxiv

17+阅读 · 2022年1月11日

Introduction to Online Convex Optimization

Arxiv

23+阅读 · 2021年12月19日

Recent Advances in Reinforcement Learning in Finance

Arxiv

11+阅读 · 2021年12月8日

Transfer Learning in Deep Reinforcement Learning: A Survey

Transfer Learning in Deep Reinforcement Learning: A Survey

Arxiv

23+阅读 · 2020年9月16日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

相关基金

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

高超声速边界层中粗糙元强制转捩的机理

国家自然科学基金

0+阅读 · 2014年12月31日

西瓜低温诱导转录因子ClMYB的功能鉴定及其调控机制解析

国家自然科学基金

0+阅读 · 2014年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

微通道气液界面波不稳定性及其对沸腾换热影响机理

国家自然科学基金

0+阅读 · 2012年12月31日

基于事件的强化学习及其在群机器人优化控制中的应用

国家自然科学基金

3+阅读 · 2012年12月31日

甲醇转化反应两种机理的对比研究

国家自然科学基金

0+阅读 · 2009年12月31日

大空间非平衡态等离子体燃烧点火及燃烧促进的研究

国家自然科学基金

0+阅读 · 2008年12月31日

TGF-β28608;活Myocardin家族诱导骨髓间充质干细胞分化的研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

11+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员