不同道路和天气条件下自主驾驶的强化课程学习调查价值 (Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions) - 专知论文

会员服务 ·

0

Performer · Automator · tuning · 强化学习 · 学成 ·

2021 年 4 月 29 日

Investigating Value of Curriculum Reinforcement Learning in Autonomous Driving Under Diverse Road and Weather Conditions

翻译：不同道路和天气条件下自主驾驶的强化课程学习调查价值

Anil Ozturk,Mustafa Burak Gunel,Resul Dagdanov,Mirac Ekim Vural,Ferhat Yurdakul,Melih Dal,Nazim Kemal Ure

from arxiv, 8 pages, 9 figures, IV2021 Workshop submission

Applications of reinforcement learning (RL) are popular in autonomous driving tasks. That being said, tuning the performance of an RL agent and guaranteeing the generalization performance across variety of different driving scenarios is still largely an open problem. In particular, getting good performance on complex road and weather conditions require exhaustive tuning and computation time. Curriculum RL, which focuses on solving simpler automation tasks in order to transfer knowledge to complex tasks, is attracting attention in RL community. The main contribution of this paper is a systematic study for investigating the value of curriculum reinforcement learning in autonomous driving applications. For this purpose, we setup several different driving scenarios in a realistic driving simulator, with varying road complexity and weather conditions. Next, we train and evaluate performance of RL agents on different sequences of task combinations and curricula. Results show that curriculum RL can yield significant gains in complex driving tasks, both in terms of driving performance and sample complexity. Results also demonstrate that different curricula might enable different benefits, which hints future research directions for automated curriculum training.

翻译：强化学习(RL)的应用在自主驾驶任务中很受欢迎。也就是说,调整RL代理的性能和保证不同驾驶方案的一般性能在很大程度上仍然是一个尚未解决的问题。特别是,在复杂的道路和天气条件下取得良好的业绩需要详尽的调整和计算时间。课程RL侧重于解决简单的自动化任务,以便将知识转移给复杂的任务。本文的主要贡献是系统研究在自主驾驶应用程序中强化学习课程的价值。为此,我们在现实的驾驶模拟器中设置了几种不同的驾驶方案,其道路复杂程度和天气条件各不相同。接下来,我们培训和评价RL代理在不同任务组合和课程序列上的性能。结果显示,RL课程可以在复杂的驾驶任务中产生重大收益,无论是驾驶业绩还是抽样复杂性。结果还表明,不同的课程可能带来不同的好处,为自动化课程培训提供未来的研究方向。

0

相关内容

Performer

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

专知会员服务

126+阅读 · 2019年11月16日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

Google Research Football (scenario 2) 实验

Google Research Football (scenario 2) 实验

CreateAMind

8+阅读 · 2019年8月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

机器人开发库软件大列表

机器人开发库软件大列表

专知

10+阅读 · 2018年3月18日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Radar Camera Fusion via Representation Learning in Autonomous Driving

Radar Camera Fusion via Representation Learning in Autonomous Driving

Arxiv

0+阅读 · 2021年6月18日

Dual Control for Exploitation and Exploration (DCEE) in Autonomous Search

Dual Control for Exploitation and Exploration (DCEE) in Autonomous Search

Arxiv

0+阅读 · 2021年6月17日

Design of a prototypical platform for autonomous and connected vehicles

Arxiv

0+阅读 · 2021年6月17日

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Arxiv

0+阅读 · 2021年6月16日

Plane and Sample: Maximizing Information about Autonomous Vehicle Performance using Submodular Optimization

Arxiv

0+阅读 · 2021年6月15日

A Multi-Layered Approach for Measuring the Simulation-to-Reality Gap of Radar Perception for Autonomous Driving

Arxiv

0+阅读 · 2021年6月15日

Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies

Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies

Arxiv

12+阅读 · 2020年6月10日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Hierarchical Reinforcement Learning with Deep Nested Agents

Arxiv

3+阅读 · 2018年5月18日

VIP会员

文章信息

相关主题

相关VIP内容

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

121+阅读 · 2019年11月24日

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

专知会员服务

126+阅读 · 2019年11月16日

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

【DeepMind-Nando de Freitas】强化学习教程，102页ppt，Reinforcement Learning

专知会员服务

84+阅读 · 2019年11月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

GAN新书《生成式深度学习》，Generative Deep Learning，379页pdf

专知会员服务

208+阅读 · 2019年9月30日

热门VIP内容

开通专知VIP会员享更多权益服务

【NTU博士论文】利用强化学习与生成模型推进可靠且可泛化的决策

美海军研发“增强侦察与态势评估系统（ARES）”应用程序以优化作战规划（附研究论文）

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

面向深度研究系统的强化学习基础：综述

相关资讯

Google Research Football (scenario 2) 实验

Google Research Football (scenario 2) 实验

CreateAMind

8+阅读 · 2019年8月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

机器人开发库软件大列表

机器人开发库软件大列表

专知

10+阅读 · 2018年3月18日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Radar Camera Fusion via Representation Learning in Autonomous Driving

Radar Camera Fusion via Representation Learning in Autonomous Driving

Arxiv

0+阅读 · 2021年6月18日

Dual Control for Exploitation and Exploration (DCEE) in Autonomous Search

Dual Control for Exploitation and Exploration (DCEE) in Autonomous Search

Arxiv

0+阅读 · 2021年6月17日

Design of a prototypical platform for autonomous and connected vehicles

Arxiv

0+阅读 · 2021年6月17日

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Arxiv

0+阅读 · 2021年6月16日

Plane and Sample: Maximizing Information about Autonomous Vehicle Performance using Submodular Optimization

Arxiv

0+阅读 · 2021年6月15日

A Multi-Layered Approach for Measuring the Simulation-to-Reality Gap of Radar Perception for Autonomous Driving

Arxiv

0+阅读 · 2021年6月15日

Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies

Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies

Arxiv

12+阅读 · 2020年6月10日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

A Tour of Reinforcement Learning: The View from Continuous Control

Arxiv

6+阅读 · 2018年6月25日

Hierarchical Reinforcement Learning with Deep Nested Agents

Arxiv

3+阅读 · 2018年5月18日

微信扫码咨询专知VIP会员