Difference between revisions of "Applications of Reinforcement Learning in Finance"
Line 17: | Line 17: | ||
== Important links == | == Important links == | ||
* [https://www.overleaf.com/read/pcpfrwsvyhqn Overleaf] | * [https://www.overleaf.com/read/pcpfrwsvyhqn Overleaf] | ||
− | * [https://github. | + | * [https://github.com/frensizej/RL_DDQN.git GitHub] |
* [https://drive.google.com/drive/folders/1E2sGajWczI4jgBBEKfqNORDO1oWQv3M2 Google Drive with paper and links] | * [https://drive.google.com/drive/folders/1E2sGajWczI4jgBBEKfqNORDO1oWQv3M2 Google Drive with paper and links] | ||
Revision as of 22:30, 23 June 2022
Details
- Authors: Frensi Zejnullahu, Maurice Moser
- Title: Applications of Reinforcement Learning in Finance - Trading with a Double Deep Q-Network.
- Supervisior: Prof. Dr. Jörg Osterrieder
- Degree: Bachelor of Science
- University: Zurich University of Applied Sciences
- Year: 2022
- Status: Working Paper
Summary
Examining if additional input features in a feedforward neural network improve the prediction accuracy for daily Bitcoin price movements and if the Bitcoin market is efficient as per the weak form of the efficient market hypothesis.
Abstract
This paper presents a Double Deep Q-Network algorithm for trading single assets, namely the E-mini S&P 500 continuous futures contract. We use a proven setup as the foundation for our environment with multiple extensions. The features of our trading agent are constantly being expanded to include additional assets such as commodities, resulting in four models. We also respond to environmental conditions, including costs and crises. Our trading agent is first trained for a specific time period and tested on new data and compared with the long-and-hold strategy as a benchmark (market). We analyze the differences between the various models and the in-sample/out-of-sample performance with respect to the environment. The experimental results show that the trading agent follows an appropriate behavior. It can adjust its policy to different circumstances, such as more extensive use of the neutral position when trading costs are present. Furthermore, the net asset value exceeded that of the benchmark, and the agent outperformed the market in the test set. We provide initial insights into the behavior of an agent in a financial domain using a DDQN algorithm. The results of this study can be used for further development.
Important links
Data
- Bitcoin (BTC-USD)
- S&P 500 (ˆGSPC)
- Russell 2000 (ˆRUT)
- EUR/USD (EURUSD=X)
- 10 Year Treasury Note Yield (^TNX)
- PHLX Gold/Silver Sector Index (^XAU)
Data source: Yahoo Finance