任健康

个人信息Personal Information

副教授

博士生导师

硕士生导师

性别：男

毕业院校：大连理工大学

学位：博士

所在单位：计算机科学与技术学院

学科：计算机应用技术

办公地点：创新园大厦A826

联系方式：

电子邮箱：

扫描关注

同专业博导同专业硕导个人学术主页

论文成果

当前位置：中文主页 >> 科学研究 >> 论文成果

Adaptively Shaping Reinforcement Learning Agents via Human Reward

点击次数：

论文类型：会议论文

发表时间：2018-01-01

收录刊物：CPCI-S

卷号：11012

页面范围：85-97

关键字：Reinforcement learning; Human reward; Reward Shaping; Human-agent interaction

摘要：The computational complexity of reinforcement learning algorithms increases exponentially with the size of the problem. An effective solution to this problem is to provide reinforcement learning agents with informationally rich human knowledge, so as to expedite the learning process. Various integration methods have been proposed to combine human reward with agent reward in reinforcement learning. However, the essential distinction of these combination methods and their respective advantages and disadvantages are still unclear. In this paper, we propose an adaptive learning algorithm that is capable of selecting the most suitable method from a portfolio of combination methods in an adaptive manner. We show empirically that our algorithm enables better learning performance under various conditions, compared to the approaches using one combination method alone. By analyzing different ways of integrating human knowledge into reinforcement learning, our work provides some important insights into understanding the role and impact of human factors in human-robot collaborative learning.

上一条：Cooperative Deep Q-Learning With Q-Value Transfer for Multi-Intersection Signal Control

下一条：mTS: Temporal- and Spatial-Collaborative Charging for Wireless Rechargeable Sensor Networks with Multiple Vehicles