Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach

Delgado, Tomás; Sánchez Sorondo, Marco; Braberman, Víctor; Uchitel, Sebastián

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.rights.license	Reconocimiento 4.0 Internacional. (CC BY)	-
dc.contributor.author	Delgado, Tomás	es
dc.contributor.author	Sánchez Sorondo, Marco	es
dc.contributor.author	Braberman, Víctor	es
dc.contributor.author	Uchitel, Sebastián	es
dc.date.accessioned	2024-02-16T13:30:44Z	-
dc.date.available	2024-02-16T13:30:44Z	-
dc.date.issued	2023-07-08	-
dc.identifier.uri	https://hdl.handle.net/20.500.12381/3418	-
dc.description.abstract	In this work, we propose a new method for obtaining heuristics based on Reinforcement Learning (RL). The synthesis algorithm is thus framed as an RL task with an unbounded action space and a modified version of DQN is used. With a simple and general set of features that abstracts both states and actions, we show that it is possible to learn heuristics on small versions of a problem that generalize to the larger instances, effectively doing zero-shot policy transfer. Our agents learn from scratch in a highly partially observable RL task and outperform the existing heuristic overall, in instances unseen during training.	es
dc.description.sponsorship	Agencia Nacional de Promoción de la Investigación, el Desarrollo Tecnológico y la Innovación	es
dc.description.sponsorship	Universidad de Buenos Aires	es
dc.description.sponsorship	Agencia Nacional de Investigación e Innovación	es
dc.language.iso	eng	es
dc.relation.uri	https://hdl.handle.net/20.500.12381/3417	es
dc.relation.uri	https://hdl.handle.net/20.500.12381/3419	-
dc.relation.uri	https://hdl.handle.net/20.500.12381/3420	-
dc.rights	Acceso abierto	*
dc.source	33rd International Conference on Automated Planning and Scheduling. Prague, Czech Republic. 2023	es
dc.subject	Artificial intelligence	es
dc.subject	Controller synthesis	es
dc.title	Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning Approach	es
dc.type	Documento de conferencia	es
dc.subject.anii	Ciencias Naturales y Exactas	-
dc.subject.anii	Ciencias de la Computación e Información	-
dc.subject.anii	Ciencias de la Computación	-
dc.identifier.anii	IA_1_2022_1_173516	es
dc.type.version	Enviado	es
dc.identifier.doi	https://doi.org/10.48550/arXiv.2210.05393	-
dc.anii.institucionresponsable	Universidad de Buenos Aires.	es
dc.anii.subjectcompleto	//Ciencias Naturales y Exactas/Ciencias de la Computación e Información/Ciencias de la Computación	es
Aparece en las colecciones:	Publicaciones de ANII

Archivos en este ítem:

archivo		Descripción	Tamaño	Formato
2210.05393v2.pdf	Descargar		1.33 MB	Adobe PDF

Las obras en REDI están protegidas por licencias Creative Commons.
Por más información sobre los términos de esta publicación, visita: Reconocimiento 4.0 Internacional. (CC BY)

Mostrar el registro sencillo del ítem