作者
Hongyang Yang, Xiao-Yang Liu, Shan Zhong, Anwar Walid
发表日期
2020/10/15
图书
Proceedings of the first ACM international conference on AI in finance
页码范围
1-8
简介
Stock trading strategies play a critical role in investment. However, it is challenging to design a profitable strategy in a complex and dynamic stock market. In this paper, we propose an ensemble strategy that employs deep reinforcement schemes to learn a stock trading strategy by maximizing investment return. We train a deep reinforcement learning agent and obtain an ensemble trading strategy using three actor-critic based algorithms: Proximal Policy Optimization (PPO), Advantage Actor Critic (A2C), and Deep Deterministic Policy Gradient (DDPG). The ensemble strategy inherits and integrates the best features of the three algorithms, thereby robustly adjusting to different market situations. In order to avoid the large memory consumption in training networks with continuous action space, we employ a load-on-demand technique for processing very large data. We test our algorithms on the 30 Dow Jones stocks …
引用总数
20202021202220232024431598348
学术搜索中的文章
H Yang, XY Liu, S Zhong, A Walid - Proceedings of the first ACM international conference …, 2020