On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels

Upasana Biswas; Lin Guan; Subbarao Kambhampati

doi:10.1145/3610978.3640692

On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels

Upasana Biswas, Lin Guan, Subbarao Kambhampati

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

When engaging in collaborative tasks with unknown team members, humans demonstrate the ability to predict the behavior of their partners and adapt to it. Autonomous agents do not exhibit such adaptability, often struggling to integrate with new partners in multi-agent cooperative scenarios. Past work towards tackling this problem includes sampling from a population of diverse training partners. This consists of self-play agents at various skill levels, generated by checkpointing at various points throughout their training. In this work, we show that such a set of agents isn't representative of human skill levels by evaluating their qualitative and quantitative performance on the Overcooked Domain. Our results demonstrate that self-play agents exhibit distinct learning patterns in contrast to humans and a partially trained self-play agent demonstrates behaviors that diverges significantly from that of a lower-skilled human counterpart.

Original language	English (US)
Title of host publication	HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction
Publisher	IEEE Computer Society
Pages	252-256
Number of pages	5
ISBN (Electronic)	9798400703232
DOIs	https://doi.org/10.1145/3610978.3640692
State	Published - Mar 11 2024
Event	19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024 - Boulder, United States Duration: Mar 11 2024 → Mar 15 2024

Publication series

Name	ACM/IEEE International Conference on Human-Robot Interaction
ISSN (Electronic)	2167-2148

Conference

Conference	19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024
Country/Territory	United States
City	Boulder
Period	3/11/24 → 3/15/24

Keywords

Ad Hoc Teaming
Human Agent Collaboration
Mutual Adaptation
Zero-Shot Coordination

ASJC Scopus subject areas

Artificial Intelligence
Human-Computer Interaction
Electrical and Electronic Engineering

Access to Document

10.1145/3610978.3640692

Cite this

Biswas, U., Guan, L., & Kambhampati, S. (2024). On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. In HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction (pp. 252-256). (ACM/IEEE International Conference on Human-Robot Interaction). IEEE Computer Society. https://doi.org/10.1145/3610978.3640692

On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. / Biswas, Upasana; Guan, Lin; Kambhampati, Subbarao.
HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. IEEE Computer Society, 2024. p. 252-256 (ACM/IEEE International Conference on Human-Robot Interaction).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Biswas, U, Guan, L & Kambhampati, S 2024, On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. in HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. ACM/IEEE International Conference on Human-Robot Interaction, IEEE Computer Society, pp. 252-256, 19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024, Boulder, United States, 3/11/24. https://doi.org/10.1145/3610978.3640692

Biswas U, Guan L, Kambhampati S. On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. In HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. IEEE Computer Society. 2024. p. 252-256. (ACM/IEEE International Conference on Human-Robot Interaction). doi: 10.1145/3610978.3640692

Biswas, Upasana ; Guan, Lin ; Kambhampati, Subbarao. / On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels. HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. IEEE Computer Society, 2024. pp. 252-256 (ACM/IEEE International Conference on Human-Robot Interaction).

@inproceedings{89649ad75a2a4cda955daae308133249,

title = "On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels",

abstract = "When engaging in collaborative tasks with unknown team members, humans demonstrate the ability to predict the behavior of their partners and adapt to it. Autonomous agents do not exhibit such adaptability, often struggling to integrate with new partners in multi-agent cooperative scenarios. Past work towards tackling this problem includes sampling from a population of diverse training partners. This consists of self-play agents at various skill levels, generated by checkpointing at various points throughout their training. In this work, we show that such a set of agents isn't representative of human skill levels by evaluating their qualitative and quantitative performance on the Overcooked Domain. Our results demonstrate that self-play agents exhibit distinct learning patterns in contrast to humans and a partially trained self-play agent demonstrates behaviors that diverges significantly from that of a lower-skilled human counterpart.",

keywords = "Ad Hoc Teaming, Human Agent Collaboration, Mutual Adaptation, Zero-Shot Coordination",

author = "Upasana Biswas and Lin Guan and Subbarao Kambhampati",

note = "Publisher Copyright: {\textcopyright} 2024 Copyright held by the owner/author(s); 19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024 ; Conference date: 11-03-2024 Through 15-03-2024",

year = "2024",

month = mar,

day = "11",

doi = "10.1145/3610978.3640692",

language = "English (US)",

series = "ACM/IEEE International Conference on Human-Robot Interaction",

publisher = "IEEE Computer Society",

pages = "252--256",

booktitle = "HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction",

}

TY - GEN

T1 - On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels

AU - Biswas, Upasana

AU - Guan, Lin

AU - Kambhampati, Subbarao

PY - 2024/3/11

Y1 - 2024/3/11

N2 - When engaging in collaborative tasks with unknown team members, humans demonstrate the ability to predict the behavior of their partners and adapt to it. Autonomous agents do not exhibit such adaptability, often struggling to integrate with new partners in multi-agent cooperative scenarios. Past work towards tackling this problem includes sampling from a population of diverse training partners. This consists of self-play agents at various skill levels, generated by checkpointing at various points throughout their training. In this work, we show that such a set of agents isn't representative of human skill levels by evaluating their qualitative and quantitative performance on the Overcooked Domain. Our results demonstrate that self-play agents exhibit distinct learning patterns in contrast to humans and a partially trained self-play agent demonstrates behaviors that diverges significantly from that of a lower-skilled human counterpart.

AB - When engaging in collaborative tasks with unknown team members, humans demonstrate the ability to predict the behavior of their partners and adapt to it. Autonomous agents do not exhibit such adaptability, often struggling to integrate with new partners in multi-agent cooperative scenarios. Past work towards tackling this problem includes sampling from a population of diverse training partners. This consists of self-play agents at various skill levels, generated by checkpointing at various points throughout their training. In this work, we show that such a set of agents isn't representative of human skill levels by evaluating their qualitative and quantitative performance on the Overcooked Domain. Our results demonstrate that self-play agents exhibit distinct learning patterns in contrast to humans and a partially trained self-play agent demonstrates behaviors that diverges significantly from that of a lower-skilled human counterpart.

KW - Ad Hoc Teaming

KW - Human Agent Collaboration

KW - Mutual Adaptation

KW - Zero-Shot Coordination

UR - http://www.scopus.com/inward/record.url?scp=85188074066&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85188074066&partnerID=8YFLogxK

U2 - 10.1145/3610978.3640692

DO - 10.1145/3610978.3640692

M3 - Conference contribution

AN - SCOPUS:85188074066

T3 - ACM/IEEE International Conference on Human-Robot Interaction

SP - 252

EP - 256

BT - HRI 2024 Companion - Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction

PB - IEEE Computer Society

T2 - 19th Annual ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024

Y2 - 11 March 2024 through 15 March 2024

ER -

On the Pitfalls of Learning to Cooperate with Self Play Agents Checkpointed to Capture Humans of Diverse Skill Levels

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this