Deep reinforcement learning for automated stock trading: An ensemble strategy

H Yang, XY Liu, S Zhong, A Walid - Proceedings of the first ACM …, 2020 - dl.acm.org
Proceedings of the first ACM international conference on AI in finance, 2020dl.acm.org
Stock trading strategies play a critical role in investment. However, it is challenging to design
a profitable strategy in a complex and dynamic stock market. In this paper, we propose an
ensemble strategy that employs deep reinforcement schemes to learn a stock trading
strategy by maximizing investment return. We train a deep reinforcement learning agent and
obtain an ensemble trading strategy using three actor-critic based algorithms: Proximal
Policy Optimization (PPO), Advantage Actor Critic (A2C), and Deep Deterministic Policy …
Stock trading strategies play a critical role in investment. However, it is challenging to design a profitable strategy in a complex and dynamic stock market. In this paper, we propose an ensemble strategy that employs deep reinforcement schemes to learn a stock trading strategy by maximizing investment return. We train a deep reinforcement learning agent and obtain an ensemble trading strategy using three actor-critic based algorithms: Proximal Policy Optimization (PPO), Advantage Actor Critic (A2C), and Deep Deterministic Policy Gradient (DDPG). The ensemble strategy inherits and integrates the best features of the three algorithms, thereby robustly adjusting to different market situations. In order to avoid the large memory consumption in training networks with continuous action space, we employ a load-on-demand technique for processing very large data. We test our algorithms on the 30 Dow Jones stocks that have adequate liquidity. The performance of the trading agent with different reinforcement learning algorithms is evaluated and compared with both the Dow Jones Industrial Average index and the traditional min-variance portfolio allocation strategy. The proposed deep ensemble strategy is shown to outperform the three individual algorithms and two baselines in terms of the risk-adjusted return measured by the Sharpe ratio.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果