Differential Assessment of Black-Box AI Agents

Rashmeet Kaur Nayyar; Pulkit Verma; Siddharth Srivastava

Differential Assessment of Black-Box AI Agents

Rashmeet Kaur Nayyar, Pulkit Verma, Siddharth Srivastava

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

Much of the research on learning symbolic models of AI agents focuses on agents with stationary models. This assumption fails to hold in settings where the agent's capabilities may change as a result of learning, adaptation, or other post-deployment modifications. Efficient assessment of agents in such settings is critical for learning the true capabilities of an AI system and for ensuring its safe usage. In this work, we propose a novel approach to differentially assess black-box AI agents that have drifted from their previously known models. As a starting point, we consider the fully observable and deterministic setting. We leverage sparse observations of the drifted agent's current behavior and knowledge of its initial model to generate an active querying policy that selectively queries the agent and computes an updated model of its functionality. Empirical evaluation shows that our approach is much more efficient than re-learning the agent model from scratch. We also show that the cost of differential assessment using our method is proportional to the amount of drift in the agent's functionality.

Original language	English (US)
Title of host publication	AAAI-22 Technical Tracks 9
Publisher	Association for the Advancement of Artificial Intelligence
Pages	9868-9876
Number of pages	9
ISBN (Electronic)	1577358767, 9781577358763
State	Published - Jun 30 2022
Event	36th AAAI Conference on Artificial Intelligence, AAAI 2022 - Virtual, Online Duration: Feb 22 2022 → Mar 1 2022

Publication series

Name	Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022
Volume	36

Conference

Conference	36th AAAI Conference on Artificial Intelligence, AAAI 2022
City	Virtual, Online
Period	2/22/22 → 3/1/22

ASJC Scopus subject areas

Artificial Intelligence

Cite this

Differential Assessment of Black-Box AI Agents. / Nayyar, Rashmeet Kaur; Verma, Pulkit; Srivastava, Siddharth.
AAAI-22 Technical Tracks 9. Association for the Advancement of Artificial Intelligence, 2022. p. 9868-9876 (Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022; Vol. 36).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Nayyar, RK, Verma, P & Srivastava, S 2022, Differential Assessment of Black-Box AI Agents. in AAAI-22 Technical Tracks 9. Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022, vol. 36, Association for the Advancement of Artificial Intelligence, pp. 9868-9876, 36th AAAI Conference on Artificial Intelligence, AAAI 2022, Virtual, Online, 2/22/22.

@inproceedings{b28fe20f9309486e97f55aebfc4f1883,

title = "Differential Assessment of Black-Box AI Agents",

abstract = "Much of the research on learning symbolic models of AI agents focuses on agents with stationary models. This assumption fails to hold in settings where the agent's capabilities may change as a result of learning, adaptation, or other post-deployment modifications. Efficient assessment of agents in such settings is critical for learning the true capabilities of an AI system and for ensuring its safe usage. In this work, we propose a novel approach to differentially assess black-box AI agents that have drifted from their previously known models. As a starting point, we consider the fully observable and deterministic setting. We leverage sparse observations of the drifted agent's current behavior and knowledge of its initial model to generate an active querying policy that selectively queries the agent and computes an updated model of its functionality. Empirical evaluation shows that our approach is much more efficient than re-learning the agent model from scratch. We also show that the cost of differential assessment using our method is proportional to the amount of drift in the agent's functionality.",

author = "Nayyar, {Rashmeet Kaur} and Pulkit Verma and Siddharth Srivastava",

note = "Publisher Copyright: Copyright {\textcopyright} 2022, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 36th AAAI Conference on Artificial Intelligence, AAAI 2022 ; Conference date: 22-02-2022 Through 01-03-2022",

year = "2022",

month = jun,

day = "30",

language = "English (US)",

series = "Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022",

publisher = "Association for the Advancement of Artificial Intelligence",

pages = "9868--9876",

booktitle = "AAAI-22 Technical Tracks 9",

}

TY - GEN

T1 - Differential Assessment of Black-Box AI Agents

AU - Nayyar, Rashmeet Kaur

AU - Verma, Pulkit

AU - Srivastava, Siddharth

PY - 2022/6/30

Y1 - 2022/6/30

N2 - Much of the research on learning symbolic models of AI agents focuses on agents with stationary models. This assumption fails to hold in settings where the agent's capabilities may change as a result of learning, adaptation, or other post-deployment modifications. Efficient assessment of agents in such settings is critical for learning the true capabilities of an AI system and for ensuring its safe usage. In this work, we propose a novel approach to differentially assess black-box AI agents that have drifted from their previously known models. As a starting point, we consider the fully observable and deterministic setting. We leverage sparse observations of the drifted agent's current behavior and knowledge of its initial model to generate an active querying policy that selectively queries the agent and computes an updated model of its functionality. Empirical evaluation shows that our approach is much more efficient than re-learning the agent model from scratch. We also show that the cost of differential assessment using our method is proportional to the amount of drift in the agent's functionality.

AB - Much of the research on learning symbolic models of AI agents focuses on agents with stationary models. This assumption fails to hold in settings where the agent's capabilities may change as a result of learning, adaptation, or other post-deployment modifications. Efficient assessment of agents in such settings is critical for learning the true capabilities of an AI system and for ensuring its safe usage. In this work, we propose a novel approach to differentially assess black-box AI agents that have drifted from their previously known models. As a starting point, we consider the fully observable and deterministic setting. We leverage sparse observations of the drifted agent's current behavior and knowledge of its initial model to generate an active querying policy that selectively queries the agent and computes an updated model of its functionality. Empirical evaluation shows that our approach is much more efficient than re-learning the agent model from scratch. We also show that the cost of differential assessment using our method is proportional to the amount of drift in the agent's functionality.

UR - http://www.scopus.com/inward/record.url?scp=85128351035&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85128351035&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85128351035

T3 - Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022

SP - 9868

EP - 9876

BT - AAAI-22 Technical Tracks 9

PB - Association for the Advancement of Artificial Intelligence

T2 - 36th AAAI Conference on Artificial Intelligence, AAAI 2022

Y2 - 22 February 2022 through 1 March 2022

ER -

Differential Assessment of Black-Box AI Agents

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this