Error bounds for approximations from projected linear equations

Huizhen Yu; Dimitri P. Bertsekas

doi:10.1287/moor.1100.0441

Error bounds for approximations from projected linear equations

Huizhen Yu, Dimitri P. Bertsekas

Research output: Contribution to journal › Article › peer-review

28 Scopus citations

Abstract

We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the ixed point mapping is a contraction, as is typically the case in Markov decision processes (MDP), one of our bounds is always sharper than the standard contraction-based bounds, and another one is often sharper. The former bound is also tight in a worst-case sense. Our bounds also apply to the noncontraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.

Original language	English (US)
Pages (from-to)	306-329
Number of pages	24
Journal	Mathematics of Operations Research
Volume	35
Issue number	2
DOIs	https://doi.org/10.1287/moor.1100.0441
State	Published - May 2010
Externally published	Yes

Keywords

Dynamic programming
Error bounds
Function approximation
Galerkin methods
Projected linear equations
Temporal difference methods

ASJC Scopus subject areas

General Mathematics
Computer Science Applications
Management Science and Operations Research

Access to Document

10.1287/moor.1100.0441

Cite this

@article{868d9dd605df49a19181fa991ffdfc1a,

title = "Error bounds for approximations from projected linear equations",

abstract = "We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the ixed point mapping is a contraction, as is typically the case in Markov decision processes (MDP), one of our bounds is always sharper than the standard contraction-based bounds, and another one is often sharper. The former bound is also tight in a worst-case sense. Our bounds also apply to the noncontraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.",

keywords = "Dynamic programming, Error bounds, Function approximation, Galerkin methods, Projected linear equations, Temporal difference methods",

author = "Huizhen Yu and Bertsekas, {Dimitri P.}",

year = "2010",

month = may,

doi = "10.1287/moor.1100.0441",

language = "English (US)",

volume = "35",

pages = "306--329",

journal = "Mathematics of Operations Research",

issn = "0364-765X",

publisher = "INFORMS Inst.for Operations Res.and the Management Sciences",

number = "2",

}

TY - JOUR

T1 - Error bounds for approximations from projected linear equations

AU - Yu, Huizhen

AU - Bertsekas, Dimitri P.

PY - 2010/5

Y1 - 2010/5

N2 - We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the ixed point mapping is a contraction, as is typically the case in Markov decision processes (MDP), one of our bounds is always sharper than the standard contraction-based bounds, and another one is often sharper. The former bound is also tight in a worst-case sense. Our bounds also apply to the noncontraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.

AB - We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the ixed point mapping is a contraction, as is typically the case in Markov decision processes (MDP), one of our bounds is always sharper than the standard contraction-based bounds, and another one is often sharper. The former bound is also tight in a worst-case sense. Our bounds also apply to the noncontraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.

KW - Dynamic programming

KW - Error bounds

KW - Function approximation

KW - Galerkin methods

KW - Projected linear equations

KW - Temporal difference methods

UR - http://www.scopus.com/inward/record.url?scp=77953119098&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953119098&partnerID=8YFLogxK

U2 - 10.1287/moor.1100.0441

DO - 10.1287/moor.1100.0441

M3 - Article

AN - SCOPUS:77953119098

SN - 0364-765X

VL - 35

SP - 306

EP - 329

JO - Mathematics of Operations Research

JF - Mathematics of Operations Research

IS - 2

ER -

Error bounds for approximations from projected linear equations

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this