Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents

Yael Septon; Tobias Huber; Elisabeth André; Ofra Amir

doi:10.1007/978-3-031-37616-0_27

Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents

Yael Septon, Tobias Huber, Elisabeth André, Ofra Amir

Data and Decision Sciences

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Explainable reinforcement learning methods can roughly be divided into local explanations that analyze specific decisions of the agents and global explanations that convey the general strategy of the agents. In this work, we study a novel combination of local and global explanations for reinforcement learning agents. Specifically, we combine reward decomposition, a local explanation method that exposes which components of the reward function influenced a specific decision, and HIGHLIGHTS, a global explanation method that shows a summary of the agent’s behavior in decisive states. Results from two user studies show significant benefits for both methods. We found that the local reward decomposition was more useful for identifying the agents’ priorities. However, when there was only a minor difference between the agents’ preferences, the global information provided by HIGHLIGHTS additionally improved participants’ understanding.

Original language	English
Title of host publication	Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings
Editors	Philippe Mathieu, Frank Dignum, Paulo Novais, Fernando De la Prieta
Pages	320-332
Number of pages	13
DOIs	https://doi.org/10.1007/978-3-031-37616-0_27
State	Published - 2023
Event	21st International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2023 - Guimaraes, Portugal Duration: 12 Jul 2023 → 14 Jul 2023

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	13955 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	21st International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2023
Country/Territory	Portugal
City	Guimaraes
Period	12/07/23 → 14/07/23

Keywords

Explainable AI
Neural Networks
Reinforcement Learning

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-031-37616-0_27

Cite this

Septon, Y., Huber, T., André, E., & Amir, O. (2023). Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents. In P. Mathieu, F. Dignum, P. Novais, & F. De la Prieta (Eds.), Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings (pp. 320-332). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13955 LNAI). https://doi.org/10.1007/978-3-031-37616-0_27

Septon, Yael ; Huber, Tobias ; André, Elisabeth et al. / Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents. Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings. editor / Philippe Mathieu ; Frank Dignum ; Paulo Novais ; Fernando De la Prieta. 2023. pp. 320-332 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{43419a4567d74680a4ce4a8932eabdf5,

title = "Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents",

abstract = "Explainable reinforcement learning methods can roughly be divided into local explanations that analyze specific decisions of the agents and global explanations that convey the general strategy of the agents. In this work, we study a novel combination of local and global explanations for reinforcement learning agents. Specifically, we combine reward decomposition, a local explanation method that exposes which components of the reward function influenced a specific decision, and HIGHLIGHTS, a global explanation method that shows a summary of the agent{\textquoteright}s behavior in decisive states. Results from two user studies show significant benefits for both methods. We found that the local reward decomposition was more useful for identifying the agents{\textquoteright} priorities. However, when there was only a minor difference between the agents{\textquoteright} preferences, the global information provided by HIGHLIGHTS additionally improved participants{\textquoteright} understanding.",

keywords = "Explainable AI, Neural Networks, Reinforcement Learning",

author = "Yael Septon and Tobias Huber and Elisabeth Andr{\'e} and Ofra Amir",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.; 21st International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2023 ; Conference date: 12-07-2023 Through 14-07-2023",

year = "2023",

doi = "10.1007/978-3-031-37616-0_27",

language = "אנגלית",

isbn = "9783031376153",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "320--332",

editor = "Philippe Mathieu and Frank Dignum and Paulo Novais and {De la Prieta}, Fernando",

booktitle = "Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings",

}

Septon, Y, Huber, T, André, E & Amir, O 2023, Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents. in P Mathieu, F Dignum, P Novais & F De la Prieta (eds), Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13955 LNAI, pp. 320-332, 21st International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2023, Guimaraes, Portugal, 12/07/23. https://doi.org/10.1007/978-3-031-37616-0_27

Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents. / Septon, Yael; Huber, Tobias; André, Elisabeth et al.
Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings. ed. / Philippe Mathieu; Frank Dignum; Paulo Novais; Fernando De la Prieta. 2023. p. 320-332 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13955 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents

AU - Septon, Yael

AU - Huber, Tobias

AU - André, Elisabeth

AU - Amir, Ofra

PY - 2023

Y1 - 2023

N2 - Explainable reinforcement learning methods can roughly be divided into local explanations that analyze specific decisions of the agents and global explanations that convey the general strategy of the agents. In this work, we study a novel combination of local and global explanations for reinforcement learning agents. Specifically, we combine reward decomposition, a local explanation method that exposes which components of the reward function influenced a specific decision, and HIGHLIGHTS, a global explanation method that shows a summary of the agent’s behavior in decisive states. Results from two user studies show significant benefits for both methods. We found that the local reward decomposition was more useful for identifying the agents’ priorities. However, when there was only a minor difference between the agents’ preferences, the global information provided by HIGHLIGHTS additionally improved participants’ understanding.

AB - Explainable reinforcement learning methods can roughly be divided into local explanations that analyze specific decisions of the agents and global explanations that convey the general strategy of the agents. In this work, we study a novel combination of local and global explanations for reinforcement learning agents. Specifically, we combine reward decomposition, a local explanation method that exposes which components of the reward function influenced a specific decision, and HIGHLIGHTS, a global explanation method that shows a summary of the agent’s behavior in decisive states. Results from two user studies show significant benefits for both methods. We found that the local reward decomposition was more useful for identifying the agents’ priorities. However, when there was only a minor difference between the agents’ preferences, the global information provided by HIGHLIGHTS additionally improved participants’ understanding.

KW - Explainable AI

KW - Neural Networks

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85169004199&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-37616-0_27

DO - 10.1007/978-3-031-37616-0_27

M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???

AN - SCOPUS:85169004199

SN - 9783031376153

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 320

EP - 332

BT - Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings

A2 - Mathieu, Philippe

A2 - Dignum, Frank

A2 - Novais, Paulo

A2 - De la Prieta, Fernando

T2 - 21st International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2023

Y2 - 12 July 2023 through 14 July 2023

ER -

Septon Y, Huber T, André E, Amir O. Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents. In Mathieu P, Dignum F, Novais P, De la Prieta F, editors, Advances in Practical Applications of Agents, Multi-Agent Systems, and Cognitive Mimetics. The PAAMS Collection - 21st International Conference, PAAMS 2023, Proceedings. 2023. p. 320-332. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-37616-0_27