Multifidelity Bayesian optimization for hyperparameter tuning of deep reinforcement learning algorithms

Garrido Merchán, Eduardo César

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/11531/101079

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Garrido Merchán, Eduardo César	es-ES
dc.date.accessioned	2025-07-15T11:08:06Z	-
dc.date.available	2025-07-15T11:08:06Z	-
dc.date.issued	2025-12-31	es_ES
dc.identifier.issn	3029-2786	es_ES
dc.identifier.uri	https:doi.org10.59400cai2923	es_ES
dc.description	Artículos en revistas	es_ES
dc.description.abstract		es-ES
dc.description.abstract	This research focuses on comparing standard Bayesian optimization and multifidelity Bayesian optimization in the hyperparameter search to improve the performance of reinforcement learning algorithms in environments such as OpenAI LunarLander and CartPole. The primary goal is to determine whether multifidelity Bayesian optimization provides significant improvements in solution quality compared to standard Bayesian optimization. To address this question, several Python implementations were developed, evaluating the solution quality using the mean of the total rewards obtained as the objective function. Various experiments were conducted for each environment and version using different seeds, ensuring that the results were not merely due to the inherent randomness of reinforcement learning algorithms. The results demonstrate that multifidelity Bayesian optimization outperforms standard Bayesian optimization in several key aspects. In the LunarLander environment, multifidelity optimization achieved better convergence and more stable performance, yielding a higher average reward compared to the standard version. In the CartPole environment, although both methods quickly reached the maximum reward, multifidelity did so with greater consistency and in less time. These findings highlight the ability of multifidelity optimization to optimize hyperparameters more efficiently, using fewer resources and less time while achieving superior performance.	en-GB
dc.language.iso	en-GB	es_ES
dc.source	Revista: Computing and Artificial Intelligence, Periodo: 1, Volumen: online, Número: 2, Página inicial: 2923-1, Página final: 2923-13	es_ES
dc.subject.other	Instituto de Investigación Tecnológica (IIT)	es_ES
dc.title	Multifidelity Bayesian optimization for hyperparameter tuning of deep reinforcement learning algorithms	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.description.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.rights.holder		es_ES
dc.rights.accessRights	info:eu-repo/semantics/openAccess	es_ES
dc.keywords		es-ES
dc.keywords	deep reinforcement learning; bayesian optimization; meta learning	en-GB
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
IIT-25-153R		409,22 kB	Unknown	Visualizar/Abrir
IIT-25-153R_preview		3,32 kB	Unknown	Visualizar/Abrir

Mostrar el registro sencillo del ítem