![]() |
个人信息Personal Information
副教授
硕士生导师
性别:男
毕业院校:大连理工大学
学位:博士
所在单位:计算机科学与技术学院
学科:计算机应用技术
办公地点:创新园大厦A826
联系方式:rjk@dlut.edu.cn
电子邮箱:rjk@dlut.edu.cn
Adaptively Shaping Reinforcement Learning Agents via Human Reward
点击次数:
论文类型:会议论文
发表时间:2018-01-01
收录刊物:CPCI-S
卷号:11012
页面范围:85-97
关键字:Reinforcement learning; Human reward; Reward Shaping; Human-agent interaction
摘要:The computational complexity of reinforcement learning algorithms increases exponentially with the size of the problem. An effective solution to this problem is to provide reinforcement learning agents with informationally rich human knowledge, so as to expedite the learning process. Various integration methods have been proposed to combine human reward with agent reward in reinforcement learning. However, the essential distinction of these combination methods and their respective advantages and disadvantages are still unclear. In this paper, we propose an adaptive learning algorithm that is capable of selecting the most suitable method from a portfolio of combination methods in an adaptive manner. We show empirically that our algorithm enables better learning performance under various conditions, compared to the approaches using one combination method alone. By analyzing different ways of integrating human knowledge into reinforcement learning, our work provides some important insights into understanding the role and impact of human factors in human-robot collaborative learning.