Infinite horizon average cost dynamic programming subject to ambiguity on conditional distribution
Date
2015ISBN
978-1-4799-7886-1Publisher
Institute of Electrical and Electronics Engineers Inc.Source
Proceedings of the IEEE Conference on Decision and ControlProceedings of the IEEE Conference on Decision and Control
Volume
54rd IEEE Conference on Decision and Control,CDC 2015Pages
7171-7176Google Scholar check
Keyword(s):
Metadata
Show full item recordAbstract
This paper addresses the optimality of stochastic control strategies based on the infinite horizon average cost criterion, subject to total variation distance ambiguity on the conditional distribution of the controlled process. This stochastic optimal control problem is formulated using minimax theory, in which the minimization is over the control strategies and the maximization is over the conditional distributions. Under the assumption that, for every stationary Markov control law the maximizing conditional distribution of the controlled process is irreducible, we derive a new dynamic programming recursion which minimizes the future ambiguity, and we propose a new policy iteration algorithm. The new dynamic programming recursion includes, in addition to the standard terms, the oscillator semi-norm of the cost-to-go. The maximizing conditional distribution is found via a water-filling algorithm. The implications of our results are demonstrated through an example. © 2015 IEEE.
Collections
Cite as
Related items
Showing items related by title, author, creator and subject.
-
Article
Classes of nonlinear partially observable stochastic optimal control problems with explicit optimal control laws
Charalambous, Charalambos D.; Elliott, R. J. (1998)This paper introduces certain nonlinear partially observable stochastic optimal control problems which are equivalent to completely observable control problems with finite-dimensional state space. In some cases the optimal ...
-
Article
Adaptive LQ control with anti-windup augmentation to optimize UAV performance in autonomous soaring applications
Kahveci, N. E.; Ioannou, Petros A.; Mirmirani, M. D. (2008)The performance of a glider can be considerably improved by means of soaring and optimization of its trajectory. Recent advances in the area of static soaring assume known linear glider dynamics and no actuator saturation ...
-
Article
Information Transfer of Control Strategies: Dualities of Stochastic Optimal Control Theory and Feedback Capacity of Information Theory
Charalambous, Charalambos D.; Kourtellaris, C. K.; Tzortzis, I. (2017)The control-coding capacity of stochastic control systems is introduced, and its operational meaning is established using randomized control strategies, which simultaneously control output processes encode information, and ...