深强化学习与进进战略:比较调查 (Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey)

Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison is provided on key aspects such as scalability, exploration, adaptation to dynamic environments, and multi-agent learning. Then, the benefits of hybrid algorithms that combine concepts from DRL and ESs are highlighted. Finally, to have an indication about how they compare in real-world applications, a survey of the literature for the set of applications they support is provided.

翻译：深入强化学习(DRL)和进化战略(ES)在许多相继决策问题上超越了人的水平控制,但仍然存在许多公开的挑战。为了深入了解DRL相对于ES的长处和短处,提供了对其各自能力和局限性的分析。在介绍其基本概念和算法之后,对可扩展性、探索、适应动态环境和多试剂学习等关键方面进行了比较。然后,强调了将DRL和ES的概念结合起来的混合算法的好处。最后,为了说明它们如何在现实世界应用中进行比较,提供了对其所支持的一系列应用的文献调查。

相关内容

深度强化学习

关注 156

深度强化学习 (DRL) 是一种使用深度学习技术扩展传统强化学习方法的一种机器学习方法。传统强化学习方法的主要任务是使得主体根据从环境中获得的奖赏能够学习到最大化奖赏的行为。然而，传统无模型强化学习方法需要使用函数逼近技术使得主体能够学习出值函数或者策略。在这种情况下，深度学习强大的函数逼近能力自然成为了替代人工指定特征的最好手段并为性能更好的端到端学习的实现提供了可能。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

专知会员服务

142+阅读 · 2020年4月30日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日