Show simple item record

dc.contributor.authorTzortzis, Ioannisen
dc.contributor.authorCharalambous, Charalambos D.en
dc.contributor.authorCharalambous, Themistoklisen
dc.creatorTzortzis, Ioannisen
dc.creatorCharalambous, Charalambos D.en
dc.creatorCharalambous, Themistoklisen
dc.date.accessioned2021-01-26T09:45:29Z
dc.date.available2021-01-26T09:45:29Z
dc.date.issued2019
dc.identifier.issn0363-0129
dc.identifier.urihttp://gnosis.library.ucy.ac.cy/handle/7/63254
dc.description.abstractWe analyze the per unit-time infinite horizon average cost Markov control model, subject to a total variation distance ambiguity on the controlled process conditional distribution. This stochastic optimal control problem is formulated as a minimax optimization problem in which the minimization is over the admissible set of control strategies, while the maximization is over the set of conditional distributions which are in a ball, with respect to the total variation distance, centered at a nominal distribution. We derive two new equivalent dynamic programming equations, and a new policy iteration algorithm. The main feature of the new dynamic programming equations is that the optimal control strategies are insensitive to inaccuracies or ambiguities in the controlled process conditional distribution. The main feature of the new policy iteration algorithm is that the policy evaluation and policy improvement steps are performed using the maximizing conditional distribution, which is obtained via a water filling solution of aggregating states together to form new states. Throughout the paper, we illustrate the new dynamic programming equations and the corresponding policy iteration algorithm to various examples.en
dc.sourceSIAM Journal on Control and Optimizationen
dc.source.urihttps://epubs.siam.org/doi/10.1137/18M1210514
dc.titleInfinite Horizon Average Cost Dynamic Programming Subject to Total Variation Distance Ambiguityen
dc.typeinfo:eu-repo/semantics/article
dc.identifier.doi10.1137/18M1210514
dc.description.volume57
dc.description.issue4
dc.description.startingpage2843
dc.description.endingpage2872
dc.author.facultyΠολυτεχνική Σχολή / Faculty of Engineering
dc.author.departmentΤμήμα Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών / Department of Electrical and Computer Engineering
dc.type.uhtypeArticleen
dc.source.abbreviationSIAM J. Control Optim.en
dc.contributor.orcidCharalambous, Charalambos D. [0000-0002-2168-0231]
dc.contributor.orcidCharalambous, Themistoklis [0000-0003-4800-6738]
dc.gnosis.orcid0000-0002-2168-0231
dc.gnosis.orcid0000-0003-4800-6738


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record