edoc

Learning Domain-Independent Policies for Open List Selection

Biedenkapp, André and Speck, David and Sievers, Silvan and Hutter, Frank and Lindauer, Marius and Seipp, Jendrik. (2022) Learning Domain-Independent Policies for Open List Selection. In: PRL 2022: Proceedings of the PRL Workshop - Bridging the Gap Between AI Planning and Reinforcement Learning. pp. 1-10.

[img] PDF
290Kb

Official URL: https://edoc.unibas.ch/93633/

Downloads: Statistics Overview

Abstract

Since its proposal over a decade ago, LAMA has been considered one of the best-performing satisficing classical planners. Its key component is heuristic search with multiple open lists, each using a different heuristic function to order states. Even with a very simple, ad-hoc policy for open list selection, LAMA achieves state-of-the-art results. In this paper, we propose to use dynamic algorithm configuration to learn such policies in a principled and data-driven manner. On the learning side, we show how to train a reinforcement learning agent over several heterogeneous environments, aiming at zero-shot generalization to new related domains. On the planning side, our experimental results show that the trained policies often reach the performance of LAMA, and sometimes even perform better. Furthermore, our analysis of different policies shows that prioritizing states reached via preferred operators is crucial, explaining the strong performance of LAMA.
Faculties and Departments:05 Faculty of Science > Departement Mathematik und Informatik > Informatik > Artificial Intelligence (Helmert)
UniBasel Contributors:Sievers, Silvan
Item Type:Conference or Workshop Item, refereed
Conference or workshop item Subtype:Conference Paper
Note:Publication type according to Uni Basel Research Database: Conference paper
Language:English
Related URLs:
edoc DOI:
Last Modified:13 Mar 2023 15:44
Deposited On:13 Mar 2023 15:44

Repository Staff Only: item control page