孙希明

个人信息Personal Information

教授

博士生导师

硕士生导师

性别:男

毕业院校:东北大学

学位:博士

所在单位:控制科学与工程学院

学科:控制理论与控制工程

办公地点:创新园大厦A622房间

电子邮箱:sunxm@dlut.edu.cn

扫描关注

论文成果

当前位置: 中文主页 >> 科学研究 >> 论文成果

Finite Convergence of Value Iteration Algorithm for Discounted Innite Horizon Optimal Control of Stochastic Logical Systems

点击次数:

论文类型:会议论文

发表时间:2016-01-01

收录刊物:CPCI-S、SCIE

页面范围:216-222

关键字:Boolean control networks; Logical networks; Semi-tensor product; Optimal control; Stochastic logical dynamics

摘要:This paper investigates the discounted in nite horizon optimal control problem for the stochastic multi-valued logical dynamical systems with nite states. After giving the equivalent descriptions of the stochastic logical dynamical system in terms of Markov decision process, the in nite horizon optimization problem is presented in an algebraic form. Based on the semitensor product of matrices and the increasing-dimension technique, it is proved that the optimal stationary policy is obtained by a nite horizon value iteration process, and an exact horizon length estimation for the nite horizon approach is derived. As an application, the optimization problem of Human-machine game is investigated.