Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

Wentao Guo; Feng Liu; Jennie Si; Shengwei Mei; Rui Li

doi:10.1109/IJCNN.2015.7280783

Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

Wentao Guo, Feng Liu, Jennie Si, Shengwei Mei, Rui Li

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For policy iteration based ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation of undiscounted value function under the assumption of exact approximation. Furthermore, the error bound of policy iteration based ADP has been analyzed in a discounted value function formulation with consideration of approximation errors. However, there has not been any error bound analysis of policy iteration based ADP in the undiscounted value function formulation with consideration of approximation errors. In this paper, we intend to fill this theoretical gap. We provide a sufficient condition on the approximation error, so that the iterative value function can be bounded in a neighbourhood of the optimal value function. To the best of the authors' knowledge, this is the first error bound result of the undiscounted policy iteration for deterministic discrete-time nonlinear systems considering approximation errors.

Original language	English (US)
Title of host publication	2015 International Joint Conference on Neural Networks, IJCNN 2015
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781479919604, 9781479919604, 9781479919604, 9781479919604
DOIs	https://doi.org/10.1109/IJCNN.2015.7280783
State	Published - Sep 28 2015
Event	International Joint Conference on Neural Networks, IJCNN 2015 - Killarney, Ireland Duration: Jul 12 2015 → Jul 17 2015

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks
Volume	2015-September

Other

Other	International Joint Conference on Neural Networks, IJCNN 2015
Country/Territory	Ireland
City	Killarney
Period	7/12/15 → 7/17/15

Keywords

Approximation algorithms
Approximation methods
Mathematical model

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1109/IJCNN.2015.7280783

Cite this

Guo, W., Liu, F., Si, J., Mei, S., & Li, R. (2015). Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems. In 2015 International Joint Conference on Neural Networks, IJCNN 2015 Article 7280783 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2015-September). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN.2015.7280783

Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems. / Guo, Wentao; Liu, Feng; Si, Jennie et al.
2015 International Joint Conference on Neural Networks, IJCNN 2015. Institute of Electrical and Electronics Engineers Inc., 2015. 7280783 (Proceedings of the International Joint Conference on Neural Networks; Vol. 2015-September).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Guo, W, Liu, F, Si, J, Mei, S & Li, R 2015, Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems. in 2015 International Joint Conference on Neural Networks, IJCNN 2015., 7280783, Proceedings of the International Joint Conference on Neural Networks, vol. 2015-September, Institute of Electrical and Electronics Engineers Inc., International Joint Conference on Neural Networks, IJCNN 2015, Killarney, Ireland, 7/12/15. https://doi.org/10.1109/IJCNN.2015.7280783

Guo W, Liu F, Si J, Mei S, Li R. Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems. In 2015 International Joint Conference on Neural Networks, IJCNN 2015. Institute of Electrical and Electronics Engineers Inc. 2015. 7280783. (Proceedings of the International Joint Conference on Neural Networks). doi: 10.1109/IJCNN.2015.7280783

Guo, Wentao ; Liu, Feng ; Si, Jennie et al. / Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems. 2015 International Joint Conference on Neural Networks, IJCNN 2015. Institute of Electrical and Electronics Engineers Inc., 2015. (Proceedings of the International Joint Conference on Neural Networks).

@inproceedings{a9e77edf22fd47639e12dc37ea6bdded,

title = "Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems",

abstract = "Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For policy iteration based ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation of undiscounted value function under the assumption of exact approximation. Furthermore, the error bound of policy iteration based ADP has been analyzed in a discounted value function formulation with consideration of approximation errors. However, there has not been any error bound analysis of policy iteration based ADP in the undiscounted value function formulation with consideration of approximation errors. In this paper, we intend to fill this theoretical gap. We provide a sufficient condition on the approximation error, so that the iterative value function can be bounded in a neighbourhood of the optimal value function. To the best of the authors' knowledge, this is the first error bound result of the undiscounted policy iteration for deterministic discrete-time nonlinear systems considering approximation errors.",

keywords = "Approximation algorithms, Approximation methods, Mathematical model",

author = "Wentao Guo and Feng Liu and Jennie Si and Shengwei Mei and Rui Li",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; International Joint Conference on Neural Networks, IJCNN 2015 ; Conference date: 12-07-2015 Through 17-07-2015",

year = "2015",

month = sep,

day = "28",

doi = "10.1109/IJCNN.2015.7280783",

language = "English (US)",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2015 International Joint Conference on Neural Networks, IJCNN 2015",

}

TY - GEN

T1 - Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

AU - Guo, Wentao

AU - Liu, Feng

AU - Si, Jennie

AU - Mei, Shengwei

AU - Li, Rui

PY - 2015/9/28

Y1 - 2015/9/28

N2 - Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For policy iteration based ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation of undiscounted value function under the assumption of exact approximation. Furthermore, the error bound of policy iteration based ADP has been analyzed in a discounted value function formulation with consideration of approximation errors. However, there has not been any error bound analysis of policy iteration based ADP in the undiscounted value function formulation with consideration of approximation errors. In this paper, we intend to fill this theoretical gap. We provide a sufficient condition on the approximation error, so that the iterative value function can be bounded in a neighbourhood of the optimal value function. To the best of the authors' knowledge, this is the first error bound result of the undiscounted policy iteration for deterministic discrete-time nonlinear systems considering approximation errors.

AB - Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For policy iteration based ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation of undiscounted value function under the assumption of exact approximation. Furthermore, the error bound of policy iteration based ADP has been analyzed in a discounted value function formulation with consideration of approximation errors. However, there has not been any error bound analysis of policy iteration based ADP in the undiscounted value function formulation with consideration of approximation errors. In this paper, we intend to fill this theoretical gap. We provide a sufficient condition on the approximation error, so that the iterative value function can be bounded in a neighbourhood of the optimal value function. To the best of the authors' knowledge, this is the first error bound result of the undiscounted policy iteration for deterministic discrete-time nonlinear systems considering approximation errors.

KW - Approximation algorithms

KW - Approximation methods

KW - Mathematical model

UR - http://www.scopus.com/inward/record.url?scp=84951023469&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951023469&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2015.7280783

DO - 10.1109/IJCNN.2015.7280783

M3 - Conference contribution

AN - SCOPUS:84951023469

T3 - Proceedings of the International Joint Conference on Neural Networks

BT - 2015 International Joint Conference on Neural Networks, IJCNN 2015

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - International Joint Conference on Neural Networks, IJCNN 2015

Y2 - 12 July 2015 through 17 July 2015

ER -

Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this