Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Dimitri P. Bertsekas

doi:10.1109/TAC.2019.2896049

Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Dimitri P. Bertsekas

Research output: Contribution to journal › Article › peer-review

16 Scopus citations

Abstract

In this paper, we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases several classical models, such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties, and we emphasize shortest path-type models. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, whereas for others it is not. In one line of analysis, we impose assumptions guaranteeing that the noncontractive policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of solution of Bellman's equation, and the validity of forms of value and policy iteration. In the absence of these assumptions, the results are weaker and unusual in character: the optimal cost function need not be a solution of Bellman's equation, and may not be found by value or policy iteration. Instead the optimal cost function over just the contractive policies is the largest solution of Bellman's equation, and can be computed by a variety of algorithms.

Original language	English (US)
Article number	8629039
Pages (from-to)	3117-3128
Number of pages	12
Journal	IEEE Transactions on Automatic Control
Volume	64
Issue number	8
DOIs	https://doi.org/10.1109/TAC.2019.2896049
State	Published - Aug 2019
Externally published	Yes

Keywords

Dynamic programming (DP)
Markov decision processes
risk sensitive control
stochastic shortest paths

ASJC Scopus subject areas

Control and Systems Engineering
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TAC.2019.2896049

Cite this

@article{75af6af0cc6d4e9788cdedcd2e43ab34,

title = "Affine Monotonic and Risk-Sensitive Models in Dynamic Programming",

abstract = "In this paper, we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases several classical models, such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties, and we emphasize shortest path-type models. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, whereas for others it is not. In one line of analysis, we impose assumptions guaranteeing that the noncontractive policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of solution of Bellman's equation, and the validity of forms of value and policy iteration. In the absence of these assumptions, the results are weaker and unusual in character: the optimal cost function need not be a solution of Bellman's equation, and may not be found by value or policy iteration. Instead the optimal cost function over just the contractive policies is the largest solution of Bellman's equation, and can be computed by a variety of algorithms.",

keywords = "Dynamic programming (DP), Markov decision processes, risk sensitive control, stochastic shortest paths",

author = "Bertsekas, {Dimitri P.}",

note = "Publisher Copyright: {\textcopyright} 1963-2012 IEEE.",

year = "2019",

month = aug,

doi = "10.1109/TAC.2019.2896049",

language = "English (US)",

volume = "64",

pages = "3117--3128",

journal = "IEEE Transactions on Automatic Control",

issn = "0018-9286",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "8",

}

TY - JOUR

T1 - Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

AU - Bertsekas, Dimitri P.

PY - 2019/8

Y1 - 2019/8

N2 - In this paper, we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases several classical models, such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties, and we emphasize shortest path-type models. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, whereas for others it is not. In one line of analysis, we impose assumptions guaranteeing that the noncontractive policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of solution of Bellman's equation, and the validity of forms of value and policy iteration. In the absence of these assumptions, the results are weaker and unusual in character: the optimal cost function need not be a solution of Bellman's equation, and may not be found by value or policy iteration. Instead the optimal cost function over just the contractive policies is the largest solution of Bellman's equation, and can be computed by a variety of algorithms.

AB - In this paper, we consider a broad class of infinite horizon discrete-time optimal control models that involve a nonnegative cost function and an affine mapping in their dynamic programming equation. They include as special cases several classical models, such as stochastic undiscounted nonnegative cost problems, stochastic multiplicative cost problems, and risk-sensitive problems with exponential cost. We focus on the case where the state space is finite and the control space has some compactness properties, and we emphasize shortest path-type models. We assume that the affine mapping has a semicontractive character, whereby for some policies it is a contraction, whereas for others it is not. In one line of analysis, we impose assumptions guaranteeing that the noncontractive policies cannot be optimal. Under these assumptions, we prove strong results that resemble those for discounted Markovian decision problems, such as the uniqueness of solution of Bellman's equation, and the validity of forms of value and policy iteration. In the absence of these assumptions, the results are weaker and unusual in character: the optimal cost function need not be a solution of Bellman's equation, and may not be found by value or policy iteration. Instead the optimal cost function over just the contractive policies is the largest solution of Bellman's equation, and can be computed by a variety of algorithms.

KW - Dynamic programming (DP)

KW - Markov decision processes

KW - risk sensitive control

KW - stochastic shortest paths

UR - http://www.scopus.com/inward/record.url?scp=85070995064&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070995064&partnerID=8YFLogxK

U2 - 10.1109/TAC.2019.2896049

DO - 10.1109/TAC.2019.2896049

M3 - Article

AN - SCOPUS:85070995064

SN - 0018-9286

VL - 64

SP - 3117

EP - 3128

JO - IEEE Transactions on Automatic Control

JF - IEEE Transactions on Automatic Control

IS - 8

M1 - 8629039

ER -

Affine Monotonic and Risk-Sensitive Models in Dynamic Programming

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this