Distributed q-learning with state tracking for multi-agent networked control

Hang Wang; Sen Lin; Hamid Jafarkhani; Junshan Zhang

Distributed q-learning with state tracking for multi-agent networked control

Hang Wang, Sen Lin, Hamid Jafarkhani, Junshan Zhang

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming a decaying injected excitation noise during the policy evaluation, we prove that the local estimation converges to the true global state, and establish the convergence of the proposed distributed ST-based Q-learning algorithm. The experimental studies corroborate our theoretical results by showing that our proposed method achieves comparable performance with the centralized case.

Original language	English (US)
Title of host publication	20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021
Publisher	International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages	1680-1682
Number of pages	3
ISBN (Electronic)	9781713832621
State	Published - 2021
Event	20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021 - Virtual, Online Duration: May 3 2021 → May 7 2021

Publication series

Name	Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
Volume	3
ISSN (Print)	1548-8403
ISSN (Electronic)	1558-2914

Conference

Conference	20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021
City	Virtual, Online
Period	5/3/21 → 5/7/21

Keywords

Linear Quadratic Control
Multi-agent
Reinforcement Learning

ASJC Scopus subject areas

Artificial Intelligence
Software
Control and Systems Engineering

Cite this

Wang, H., Lin, S., Jafarkhani, H., & Zhang, J. (2021). Distributed q-learning with state tracking for multi-agent networked control. In 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021 (pp. 1680-1682). (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 3). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Distributed q-learning with state tracking for multi-agent networked control. / Wang, Hang; Lin, Sen; Jafarkhani, Hamid et al.
20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2021. p. 1680-1682 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 3).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Wang, H, Lin, S, Jafarkhani, H & Zhang, J 2021, Distributed q-learning with state tracking for multi-agent networked control. in 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, vol. 3, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 1680-1682, 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021, Virtual, Online, 5/3/21.

Wang H, Lin S, Jafarkhani H, Zhang J. Distributed q-learning with state tracking for multi-agent networked control. In 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). 2021. p. 1680-1682. (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

Wang, Hang ; Lin, Sen ; Jafarkhani, Hamid et al. / Distributed q-learning with state tracking for multi-agent networked control. 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2021. pp. 1680-1682 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

@inproceedings{35244acf65cd449cb28ea88beb8f8776,

title = "Distributed q-learning with state tracking for multi-agent networked control",

abstract = "This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming a decaying injected excitation noise during the policy evaluation, we prove that the local estimation converges to the true global state, and establish the convergence of the proposed distributed ST-based Q-learning algorithm. The experimental studies corroborate our theoretical results by showing that our proposed method achieves comparable performance with the centralized case.",

keywords = "Linear Quadratic Control, Multi-agent, Reinforcement Learning",

author = "Hang Wang and Sen Lin and Hamid Jafarkhani and Junshan Zhang",

note = "Funding Information: This work is supported in part by NSF Grants CNS-2003081 and CPS-1739344. Publisher Copyright: {\textcopyright} 2021 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.; 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021 ; Conference date: 03-05-2021 Through 07-05-2021",

year = "2021",

language = "English (US)",

series = "Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS",

publisher = "International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)",

pages = "1680--1682",

booktitle = "20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021",

}

TY - GEN

T1 - Distributed q-learning with state tracking for multi-agent networked control

AU - Wang, Hang

AU - Lin, Sen

AU - Jafarkhani, Hamid

AU - Zhang, Junshan

N1 - Funding Information: This work is supported in part by NSF Grants CNS-2003081 and CPS-1739344. Publisher Copyright: © 2021 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved.

PY - 2021

Y1 - 2021

N2 - This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming a decaying injected excitation noise during the policy evaluation, we prove that the local estimation converges to the true global state, and establish the convergence of the proposed distributed ST-based Q-learning algorithm. The experimental studies corroborate our theoretical results by showing that our proposed method achieves comparable performance with the centralized case.

AB - This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming a decaying injected excitation noise during the policy evaluation, we prove that the local estimation converges to the true global state, and establish the convergence of the proposed distributed ST-based Q-learning algorithm. The experimental studies corroborate our theoretical results by showing that our proposed method achieves comparable performance with the centralized case.

KW - Linear Quadratic Control

KW - Multi-agent

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85112253409&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85112253409&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85112253409

T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

SP - 1680

EP - 1682

BT - 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021

PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

T2 - 20th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2021

Y2 - 3 May 2021 through 7 May 2021

ER -

Distributed q-learning with state tracking for multi-agent networked control

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this