数学物理学报 ›› 1997, Vol. 17 ›› Issue (4): 432-438.

• 论文 • 上一篇    下一篇

非齐次马氏决策过程的齐次化

侯振挺, 郭先平   

  1. 长沙铁道学院科研所 长沙 410075
  • 收稿日期:1996-04-22 出版日期:1997-08-26 发布日期:1997-08-26
  • 基金资助:
    国家自然科学基金

The Homogenization of Non-homogeneous Markov Decision Processes

Hou Zhenting, Guo Xianping   

  1. Changsha Railway University 410075
  • Received:1996-04-22 Online:1997-08-26 Published:1997-08-26

摘要: 该文考虑的是可数状态空间有限行动空间非齐次马氏决策过程的期望总报酬准则.与以往不同的是,我们是通过扩大状态空间的方法,将非齐次的马氏决策过程转化成齐次的马氏决策过程,于是非常简洁地得到了按传统的方法所得的主要结果.

关键词: 马氏决策过程, 非齐次, 齐次, 期望总报酬准则, 最优策略

Abstract: In this paper we consider the homogenization of Non-homogneous Markov decision model with expected total reward criterion.denumerable state space and finite action spaces. We translate the non-homogeneous Markov decision processes into a homegeneous one by the method of extanding state space which is different from the usual one, and then we easily obtain the mail results obtained by usual method. Specially, the mail results obtained by K. Hinderer.

Key words: Markov decision processes, Non-homogeneous, Homogenization, Optimal policies