Policy synthesis for factored MDPs with graph temporal logic specifications

Murat Cubuktepe; Zhe Xu; Ufuk Topcu

Policy synthesis for factored MDPs with graph temporal logic specifications

Murat Cubuktepe, Zhe Xu, Ufuk Topcu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies on disease control and urban security. The numerical examples show that the algorithm can scale to hundreds of agents.

Original language	English (US)
Title of host publication	Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
Editors	Bo An, Amal El Fallah Seghrouchni, Gita Sukthankar
Publisher	International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Pages	267-275
Number of pages	9
ISBN (Electronic)	9781450375184
State	Published - 2020
Externally published	Yes
Event	19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 - Virtual, Auckland, New Zealand Duration: May 19 2020 → …

Publication series

Name	Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
Volume	2020-May
ISSN (Print)	1548-8403
ISSN (Electronic)	1558-2914

Conference

Conference	19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020
Country/Territory	New Zealand
City	Virtual, Auckland
Period	5/19/20 → …

ASJC Scopus subject areas

Artificial Intelligence
Software
Control and Systems Engineering

Cite this

Cubuktepe, M., Xu, Z., & Topcu, U. (2020). Policy synthesis for factored MDPs with graph temporal logic specifications. In B. An, A. El Fallah Seghrouchni, & G. Sukthankar (Eds.), Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 (pp. 267-275). (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 2020-May). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS).

Policy synthesis for factored MDPs with graph temporal logic specifications. / Cubuktepe, Murat; Xu, Zhe; Topcu, Ufuk.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. ed. / Bo An; Amal El Fallah Seghrouchni; Gita Sukthankar. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2020. p. 267-275 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS; Vol. 2020-May).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Cubuktepe, M, Xu, Z & Topcu, U 2020, Policy synthesis for factored MDPs with graph temporal logic specifications. in B An, A El Fallah Seghrouchni & G Sukthankar (eds), Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, vol. 2020-May, International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), pp. 267-275, 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020, Virtual, Auckland, New Zealand, 5/19/20.

Cubuktepe M, Xu Z, Topcu U. Policy synthesis for factored MDPs with graph temporal logic specifications. In An B, El Fallah Seghrouchni A, Sukthankar G, editors, Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS). 2020. p. 267-275. (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

Cubuktepe, Murat ; Xu, Zhe ; Topcu, Ufuk. / Policy synthesis for factored MDPs with graph temporal logic specifications. Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020. editor / Bo An ; Amal El Fallah Seghrouchni ; Gita Sukthankar. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), 2020. pp. 267-275 (Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS).

@inproceedings{4cca3173de0e441bb7bf1430ecc21610,

title = "Policy synthesis for factored MDPs with graph temporal logic specifications",

abstract = "We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies on disease control and urban security. The numerical examples show that the algorithm can scale to hundreds of agents.",

author = "Murat Cubuktepe and Zhe Xu and Ufuk Topcu",

note = "Funding Information: Partially funded by the grants AFRL FA9550-19-1-0169, and ONR N00014-18-1-2829. Proc. of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2020), B. An, N. Yorke-Smith, A. El Fallah Seghrouchni, G. Sukthankar (eds.), May 9–13, 2020, Auckland, New Zealand. {\textcopyright} 2020 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved. Publisher Copyright: {\textcopyright} 2020 International Foundation for Autonomous.; 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020 ; Conference date: 19-05-2020",

year = "2020",

language = "English (US)",

series = "Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS",

publisher = "International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)",

pages = "267--275",

editor = "Bo An and {El Fallah Seghrouchni}, Amal and Gita Sukthankar",

booktitle = "Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020",

}

TY - GEN

T1 - Policy synthesis for factored MDPs with graph temporal logic specifications

AU - Cubuktepe, Murat

AU - Xu, Zhe

AU - Topcu, Ufuk

N1 - Funding Information: Partially funded by the grants AFRL FA9550-19-1-0169, and ONR N00014-18-1-2829. Proc. of the 19th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2020), B. An, N. Yorke-Smith, A. El Fallah Seghrouchni, G. Sukthankar (eds.), May 9–13, 2020, Auckland, New Zealand. © 2020 International Foundation for Autonomous Agents and Multiagent Systems (www.ifaamas.org). All rights reserved. Publisher Copyright: © 2020 International Foundation for Autonomous.

PY - 2020

Y1 - 2020

N2 - We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies on disease control and urban security. The numerical examples show that the algorithm can scale to hundreds of agents.

AB - We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies on disease control and urban security. The numerical examples show that the algorithm can scale to hundreds of agents.

UR - http://www.scopus.com/inward/record.url?scp=85089573987&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85089573987&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85089573987

T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS

SP - 267

EP - 275

BT - Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020

A2 - An, Bo

A2 - El Fallah Seghrouchni, Amal

A2 - Sukthankar, Gita

PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

T2 - 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2020

Y2 - 19 May 2020

ER -

Policy synthesis for factored MDPs with graph temporal logic specifications

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this