Acta mathematica scientia,Series A ›› 2000, Vol. 20 ›› Issue (1): 31-35.

• Articles • Previous Articles     Next Articles

The sample path optimality for nonstationary MDP with Average criterion

  

  1. department of mathematics,zhongshan university,guangzhou 510275
  • Online:2000-01-07 Published:2000-01-07

Abstract:

Inthispaper,weconsiderthesamplepathoptimalityfornonstationary MDPwitharbitrarystateandactionspacesunderaveragecriterion.Bythemartingaletheory,weprovetheexistenceofoptimalMarkovpoliciesundertheweakergodicconditions,andthenextendthemainresultsobtainedbyA.Aropostathis,V.Borkar,E.F.Gaucherand,M.GhoshandS.Marcus[1](1993)

Key words: Markovdecisionprogramming (MDP), Averagesamplepathcriterion, Nonstationary, OptimalMarkovpolicies

CLC Number: 

  • 90C40
Trendmd