采用新的软计算方法,将专家知识纳入加强学习问题 (A new soft computing method for integration of expert's knowledge in reinforcement learn-ing problems)

This paper proposes a novel fuzzy action selection method to leverage human knowledge in reinforcement learning problems. Based on the estimates of the most current action-state values, the proposed fuzzy nonlinear mapping as-signs each member of the action set to its probability of being chosen in the next step. A user tunable parameter is introduced to control the action selection policy, which determines the agent's greedy behavior throughout the learning process. This parameter resembles the role of the temperature parameter in the softmax action selection policy, but its tuning process can be more knowledge-oriented since this parameter reflects the human knowledge into the learning agent by making modifications in the fuzzy rule base. Simulation results indicate that including fuzzy logic within the reinforcement learning in the proposed manner improves the learning algorithm's convergence rate, and provides superior performance.

翻译：本文提出了一个新的模糊行动选择方法, 以利用人类知识来强化学习问题。根据对当前行动状态值的估计, 拟议的模糊非线性绘图代表每个行动成员在下一个步骤中被选择的可能性。引入了一个用户可调试参数来控制行动选择政策, 以决定该代理人在整个学习过程中的贪婪行为。这个参数类似于温度参数在软体动作选择政策中的作用, 但其调控过程可以更加面向知识, 因为这个参数通过修改模糊规则基础, 将人类知识反映到学习媒介中。模拟结果显示, 以拟议的方式将模糊逻辑纳入强化学习中可以提高学习算法的趋同率, 并提供更优的性能。

相关内容

Soft Computing

关注 126

软计算（Soft Computing）致力于基于软计算技术的系统解决方案。它提供了软计算技术的重要成果的快速传播，融合了进化算法和遗传规划、神经科学和神经网络系统、模糊集理论和模糊系统、混沌理论和混沌系统的研究。软计算鼓励将软计算技术和工具集成到日常和高级应用程序中。通过将软计算的思想和技术与其他学科联系起来。因此，该杂志是一个所有科学家和工程师在这个快速增长的领域从事研究和发展的国际论坛。官网地址：http://dblp.uni-trier.de/db/journals/soco/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【干货书】机器人元素Elements of Robotics ，311页pdf

专知会员服务

38+阅读 · 2021年4月16日

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

【RLChina2020公开课】Lecture-11.pdf【多智能体学习与游戏AI前沿】

专知会员服务

27+阅读 · 2020年8月6日