Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/11531/56114
Título : Optimising a microgrid system by deep reinforcement learning techniques
Autor : Domínguez Barbero, Claudia
García González, Javier
Sanz Bobi, Miguel Ángel
Sánchez Ubeda, Eugenio Francisco
Fecha de publicación : 1-jun-2020
Resumen : The deployment of microgrids could be fostered by control systems that do not require very complex modelling, calibration, prediction and/or optimisation processes. This paper explores the application of Reinforcement Learning (RL) techniques for the operation of a microgrid. The implemented Deep Q-Network (DQN) can learn an optimal policy for the operation of the elements of an isolated microgrid, based on the interaction agent-environment when particular operation actions are taken in the microgrid components. In order to facilitate the scaling-up of this solution, the algorithm relies exclusively on historical data from past events, and therefore it does not require forecasts of the demand or the renewable generation. The objective is to minimise the cost of operating the microgrid, including the penalty of non-served power. This paper analyses the effect of considering different definitions for the state of the system by expanding the set of variables that define it. The obtained results are very satisfactory as it can be concluded by their comparison with the perfect-information optimal operation computed with a traditional optimisation model, and with a Naive model.
The deployment of microgrids could be fostered by control systems that do not require very complex modelling, calibration, prediction and/or optimisation processes. This paper explores the application of Reinforcement Learning (RL) techniques for the operation of a microgrid. The implemented Deep Q-Network (DQN) can learn an optimal policy for the operation of the elements of an isolated microgrid, based on the interaction agent-environment when particular operation actions are taken in the microgrid components. In order to facilitate the scaling-up of this solution, the algorithm relies exclusively on historical data from past events, and therefore it does not require forecasts of the demand or the renewable generation. The objective is to minimise the cost of operating the microgrid, including the penalty of non-served power. This paper analyses the effect of considering different definitions for the state of the system by expanding the set of variables that define it. The obtained results are very satisfactory as it can be concluded by their comparison with the perfect-information optimal operation computed with a traditional optimisation model, and with a Naive model.
Descripción : Artículos en revistas
URI : https://doi.org/10.3390/en13112830
ISSN : 1996-1073
Aparece en las colecciones: Artículos

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
IIT-20-051A.pdf479,65 kBAdobe PDFVista previa
Visualizar/Abrir
IIT-20-051A479,65 kBUnknownVisualizar/Abrir
IIT-20-051A_preview2,9 kBUnknownVisualizar/Abrir
IIT-20-051A479,65 kBUnknownVisualizar/Abrir
IIT-20-051A_preview.pdf2,9 kBAdobe PDFVisualizar/Abrir


Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.