Optimising a microgrid system by deep reinforcement learning techniques

Domínguez Barbero, Claudia; García González, Javier; Sanz Bobi, Miguel Ángel; Sánchez Ubeda, Eugenio Francisco

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/11531/56114

Título :	Optimising a microgrid system by deep reinforcement learning techniques
Autor :	Domínguez Barbero, Claudia García González, Javier Sanz Bobi, Miguel Ángel Sánchez Ubeda, Eugenio Francisco
Fecha de publicación :	1-jun-2020
Resumen :	The deployment of microgrids could be fostered by control systems that do not require very complex modelling, calibration, prediction and/or optimisation processes. This paper explores the application of Reinforcement Learning (RL) techniques for the operation of a microgrid. The implemented Deep Q-Network (DQN) can learn an optimal policy for the operation of the elements of an isolated microgrid, based on the interaction agent-environment when particular operation actions are taken in the microgrid components. In order to facilitate the scaling-up of this solution, the algorithm relies exclusively on historical data from past events, and therefore it does not require forecasts of the demand or the renewable generation. The objective is to minimise the cost of operating the microgrid, including the penalty of non-served power. This paper analyses the effect of considering different definitions for the state of the system by expanding the set of variables that define it. The obtained results are very satisfactory as it can be concluded by their comparison with the perfect-information optimal operation computed with a traditional optimisation model, and with a Naive model. The deployment of microgrids could be fostered by control systems that do not require very complex modelling, calibration, prediction and/or optimisation processes. This paper explores the application of Reinforcement Learning (RL) techniques for the operation of a microgrid. The implemented Deep Q-Network (DQN) can learn an optimal policy for the operation of the elements of an isolated microgrid, based on the interaction agent-environment when particular operation actions are taken in the microgrid components. In order to facilitate the scaling-up of this solution, the algorithm relies exclusively on historical data from past events, and therefore it does not require forecasts of the demand or the renewable generation. The objective is to minimise the cost of operating the microgrid, including the penalty of non-served power. This paper analyses the effect of considering different definitions for the state of the system by expanding the set of variables that define it. The obtained results are very satisfactory as it can be concluded by their comparison with the perfect-information optimal operation computed with a traditional optimisation model, and with a Naive model.
Descripción :	Artículos en revistas
URI :	https://doi.org/10.3390/en13112830
ISSN :	1996-1073
Aparece en las colecciones:	Artículos

Ficheros en este ítem:

Fichero	Tamaño	Formato
IIT-20-051A.pdf	479,65 kB	Adobe PDF	Visualizar/Abrir
IIT-20-051A	479,65 kB	Unknown	Visualizar/Abrir
IIT-20-051A_preview	2,9 kB	Unknown	Visualizar/Abrir
IIT-20-051A	479,65 kB	Unknown	Visualizar/Abrir
IIT-20-051A_preview.pdf	2,9 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro Dublin Core completo del ítem