Proximal algorithms and temporal difference methods for solving fixed point problems

Dimitri P. Bertsekas

doi:10.1007/s10589-018-9990-5

Proximal algorithms and temporal difference methods for solving fixed point problems

Dimitri P. Bertsekas

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD(λ), LSTD(λ), and LSPE(λ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.

Original language	English (US)
Pages (from-to)	709-736
Number of pages	28
Journal	Computational Optimization and Applications
Volume	70
Issue number	3
DOIs	https://doi.org/10.1007/s10589-018-9990-5
State	Published - Jul 1 2018
Externally published	Yes

Keywords

Convex optimization
Dynamic programming
Fixed point problems
Proximal algorithm
Temporal differences

ASJC Scopus subject areas

Control and Optimization
Computational Mathematics
Applied Mathematics

Access to Document

10.1007/s10589-018-9990-5

Cite this

@article{ecdc175de1154b399b61b601d73bff8f,

title = "Proximal algorithms and temporal difference methods for solving fixed point problems",

abstract = "In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD(λ), LSTD(λ), and LSPE(λ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.",

keywords = "Convex optimization, Dynamic programming, Fixed point problems, Proximal algorithm, Temporal differences",

author = "Bertsekas, {Dimitri P.}",

note = "Publisher Copyright: {\textcopyright} 2018, Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2018",

month = jul,

day = "1",

doi = "10.1007/s10589-018-9990-5",

language = "English (US)",

volume = "70",

pages = "709--736",

journal = "Computational Optimization and Applications",

issn = "0926-6003",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Proximal algorithms and temporal difference methods for solving fixed point problems

AU - Bertsekas, Dimitri P.

PY - 2018/7/1

Y1 - 2018/7/1

N2 - In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD(λ), LSTD(λ), and LSPE(λ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.

AB - In this paper we consider large fixed point problems and solution with proximal algorithms. We show that for linear problems there is a close connection between proximal iterations, which are prominent in numerical analysis and optimization, and multistep methods of the temporal difference type such as TD(λ), LSTD(λ), and LSPE(λ), which are central in simulation-based exact and approximate dynamic programming. One benefit of this connection is a new and simple way to accelerate the standard proximal algorithm by extrapolation towards a multistep iteration, which generically has a faster convergence rate. Another benefit is the potential for integration into the proximal algorithmic context of several new ideas that have emerged in the approximate dynamic programming context, including simulation-based implementations. Conversely, the analytical and algorithmic insights from proximal algorithms can be brought to bear on the analysis and the enhancement of temporal difference methods. We also generalize our linear case result to nonlinear problems that involve a contractive mapping, thus providing guaranteed and potentially substantial acceleration of the proximal and forward backward splitting algorithms at no extra cost. Moreover, under certain monotonicity assumptions, we extend the connection with temporal difference methods to nonlinear problems through a linearization approach.

KW - Convex optimization

KW - Dynamic programming

KW - Fixed point problems

KW - Proximal algorithm

KW - Temporal differences

UR - http://www.scopus.com/inward/record.url?scp=85045071819&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045071819&partnerID=8YFLogxK

U2 - 10.1007/s10589-018-9990-5

DO - 10.1007/s10589-018-9990-5

M3 - Article

AN - SCOPUS:85045071819

SN - 0926-6003

VL - 70

SP - 709

EP - 736

JO - Computational Optimization and Applications

JF - Computational Optimization and Applications

IS - 3

ER -

Proximal algorithms and temporal difference methods for solving fixed point problems

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this