Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach

Delgado, Tomás; Sánchez Sorondo, Marco; Braberman, Víctor; Uchitel, Sebastián

Título :	Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach
Autor(es) :	Delgado, Tomás Sánchez Sorondo, Marco Braberman, Víctor Uchitel, Sebastián
Fecha de publicación :	8-jul-2023
Tipo de publicación:	Documento de conferencia
Versión:	Enviado
Publicado en:	33rd International Conference on Automated Planning and Scheduling. Prague, Czech Republic. 2023
Areas del conocimiento :	Ciencias Naturales y Exactas Ciencias de la Computación e Información Ciencias de la Computación
Otros descriptores :	Artificial intelligence Controller synthesis
Resumen :	In this work, we propose a new method for obtaining heuristics based on Reinforcement Learning (RL). The synthesis algorithm is thus framed as an RL task with an unbounded action space and a modified version of DQN is used. With a simple and general set of features that abstracts both states and actions, we show that it is possible to learn heuristics on small versions of a problem that generalize to the larger instances, effectively doing zero-shot policy transfer. Our agents learn from scratch in a highly partially observable RL task and outperform the existing heuristic overall, in instances unseen during training.
URI / Handle:	https://hdl.handle.net/20.500.12381/3418
Recursos relacionados en REDI:	https://hdl.handle.net/20.500.12381/3417 https://hdl.handle.net/20.500.12381/3419 https://hdl.handle.net/20.500.12381/3420
DOI:	https://doi.org/10.48550/arXiv.2210.05393
Institución responsable del proyecto:	Universidad de Buenos Aires.
Financiadores:	Agencia Nacional de Promoción de la Investigación, el Desarrollo Tecnológico y la Innovación Universidad de Buenos Aires Agencia Nacional de Investigación e Innovación
Identificador ANII:	IA_1_2022_1_173516
Nivel de Acceso:	Acceso abierto
Licencia CC:	Reconocimiento 4.0 Internacional. (CC BY)
Aparece en las colecciones:	Publicaciones de ANII

Archivos en este ítem:

archivo		Descripción	Tamaño	Formato
2210.05393v2.pdf	Descargar		1.33 MB	Adobe PDF

Las obras en REDI están protegidas por licencias Creative Commons.
Por más información sobre los términos de esta publicación, visita: Reconocimiento 4.0 Internacional. (CC BY)

Mostrar el registro Dublin Core completo del ítem