TY - GEN
T1 - New error bounds for approximations from projected linear equations
AU - Yu, Huizhen
AU - Bertsekas, Dimitri P.
N1 - Copyright:
Copyright 2009 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the fixed point mapping is a contraction, as is typically the case in Markovian decision processes (MDP), one of our bounds is always sharper than the standard worst case bounds, and another one is often sharper. Our bounds also apply to the non-contraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.
AB - We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the fixed point mapping is a contraction, as is typically the case in Markovian decision processes (MDP), one of our bounds is always sharper than the standard worst case bounds, and another one is often sharper. Our bounds also apply to the non-contraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.
UR - http://www.scopus.com/inward/record.url?scp=58449106856&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=58449106856&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-89722-4_20
DO - 10.1007/978-3-540-89722-4_20
M3 - Conference contribution
AN - SCOPUS:58449106856
SN - 3540897216
SN - 9783540897217
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 253
EP - 267
BT - Recent Advances in Reinforcement Learning - 8th European Workshop, EWRL 2008, Revised and Selected Papers
T2 - 8th European Workshop on Reinforcement Learning, EWRL 2008
Y2 - 30 June 2008 through 3 July 2008
ER -