Por favor, use este identificador para citar o enlazar este ítem:
http://hdl.handle.net/11531/104037Registro completo de metadatos
| Campo DC | Valor | Lengua/Idioma |
|---|---|---|
| dc.contributor.author | de la Rica Escudero, Alejandra | es-ES |
| dc.contributor.author | Garrido Merchán, Eduardo César | es-ES |
| dc.contributor.author | Coronado Vaca, María | es-ES |
| dc.date.accessioned | 2025-09-15T08:04:55Z | - |
| dc.date.available | 2025-09-15T08:04:55Z | - |
| dc.date.issued | 2025-01-16 | es_ES |
| dc.identifier.issn | 1932-6203 | es_ES |
| dc.identifier.uri | https://doi.org/10.1371/journal.pone.0315528 | es_ES |
| dc.identifier.uri | http://hdl.handle.net/11531/104037 | - |
| dc.description | Artículos en revistas | es_ES |
| dc.description.abstract | El artículo propone un enfoque novedoso para dotar de interpretabilidad al Aprendizaje por Refuerzo Profundo (DRL) aplicado a la gestión de carteras financieras. A diferencia de los modelos clásicos, como la teoría de Markowitz, poco fiables en mercados volátiles, los agentes DRL ofrecen soluciones más flexibles, pero opacas en su funcionamiento. Para superar esta limitación, los autores desarrollan un marco de Aprendizaje por Refuerzo Profundo Explicable (XDRL) que integra el algoritmo Proximal Policy Optimization (PPO) con técnicas de interpretación post hoc como SHAP, LIME y análisis de importancia de características. A través de experimentos con acciones tecnológicas, se muestra cómo estas herramientas permiten identificar las variables clave que influyen en las decisiones de inversión del agente en tiempo real. El trabajo constituye la primera propuesta de política financiera post hoc explicable basada en DRL, con implicaciones prácticas para aumentar la confianza de inversores y reguladores. | es-ES |
| dc.description.abstract | Financial portfolio management investment policies computed quantitatively by modern portfolio theory techniques like the Markowitz model rely on a set of assumptions that are not supported by data in high volatility markets such as the technological sector or cryptocurrencies. Hence, quantitative researchers are looking for alternative models to tackle this problem. Concretely, portfolio management (PM) is a problem that has been successfully addressed recently by Deep Reinforcement Learning (DRL) approaches. In particular, DRL algorithms train an agent by estimating the distribution of the expected reward of every action performed by an agent given any financial state in a simulator, also called gymnasium. However, these methods rely on Deep Neural Networks model to represent such a distribution, that although they are universal approximator models, capable of representing this distribution over time, they cannot explain its behaviour, given by a set of parameters that are not interpretable. Critically, financial investors policies require predictions to be interpretable, to assess whether they follow a reasonable behaviour, so DRL agents are not suited to follow a particular policy or explain their actions. In this work, driven by the motivation of making DRL explainable, we developed a novel Explainable DRL (XDRL) approach for PM, integrating the Proximal Policy Optimization (PPO) DRL algorithm with the model agnostic explainable machine learning techniques of feature importance, SHAP and LIME to enhance transparency in prediction time. By executing our methodology, we can interpret in prediction time the actions of the agent to assess whether they follow the requisites of an investment policy or to assess the risk of following the agent’s suggestions. We empirically illustrate it by successfully identifying key features influencing investment decisions, which demonstrate the ability to explain the agent actions in prediction time. We propose the first explainable post hoc PM financial policy of a DRL agent. | en-GB |
| dc.format.mimetype | application/pdf | es_ES |
| dc.language.iso | es-ES | es_ES |
| dc.rights | Creative Commons Reconocimiento-NoComercial-SinObraDerivada España | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ | es_ES |
| dc.source | Revista: PLoS One, Periodo: 1, Volumen: ., Número: ., Página inicial: ., Página final: . | es_ES |
| dc.title | Explainable post hoc portfolio management financial policy of a Deep Reinforcement Learning agent | es_ES |
| dc.type | info:eu-repo/semantics/article | es_ES |
| dc.description.version | info:eu-repo/semantics/publishedVersion | es_ES |
| dc.rights.holder | es_ES | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | es_ES |
| dc.keywords | Aprendizaje por Refuerzo Profundo; explicabilidad; gestión de carteras; inteligencia artificial explicable; SHAP; LIME; importancia de características; mercados financieros. | es-ES |
| dc.keywords | Deep Reinforcement Learning; explainability; portfolio management; explainable artificial intelligence; SHAP; LIME; feature importance; financial markets. | en-GB |
| Aparece en las colecciones: | Artículos | |
Ficheros en este ítem:
| Fichero | Tamaño | Formato | |
|---|---|---|---|
| journal.pone.0315528.pdf | 1,35 MB | Adobe PDF | Visualizar/Abrir |
Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.