FlapAI Bird:培训一名利用强化学习技术玩玩小鸟的代理 (FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning Techniques)

Reinforcement learning is one of the most popular approach for automated game playing. This method allows an agent to estimate the expected utility of its state in order to make optimal actions in an unknown environment. We seek to apply reinforcement learning algorithms to the game Flappy Bird. We implement SARSA and Q-Learning with some modifications such as $\epsilon$-greedy policy, discretization and backward updates. We find that SARSA and Q-Learning outperform the baseline, regularly achieving scores of 1400+, with the highest in-game score of 2069.

翻译：强化学习是最受欢迎的自动游戏游戏方法之一。这种方法使代理商能够估计其状态的预期效用, 以便在未知环境中采取最佳行动。我们试图对游戏飞禽应用强化学习算法。我们实施SASA和Q学习, 并做了一些修改, 如$\ epsilon$- greedy 政策、离散和后退更新。我们发现SASA和Q- 学习比基准要好, 经常达到1400+的分数, 最高得分为2069 。

相关内容

Flappy Bird

关注 0

Flappy Bird （飞扬的小鸟 、 像素鸟、下坠的小鸟、笨鸟） 是一款由来自越南的独立游戏开发者Dong Nguyen所开发的作品，游戏于2013年5月24日上线，并在2014年2月突然暴红。
2014年2月，《Flappy Bird》被开发者本人从苹果及谷歌应用商店撤下。2014年8月份正式回归APP STORE，正式加入Flappy迷们期待已久的多人对战模式。游戏中玩家必须控制一只小鸟，跨越由各种不同长度水管所组成的障碍。

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【强化学习资源集合】Awesome Reinforcement Learning

专知会员服务

97+阅读 · 2019年12月23日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日