Training an Agent using Deep Reinforcement Learning: Snake Game
Keywords:
Deep reinforcement learning, Q-Learning, Deep Neural Network, Deep Learning, Experience replay.Abstract
Deep Reinforcement Learning has become a commonly adopted method to enable agents to hunt out complex control policies in various video games. Deep-Mind used this technique to play Atari games. However, similar approaches should get to be improved when applied to tougher scenarios, where reward signals are sparse and delayed. This paper illustrates a refined Deep Reinforcement Learning model to enable an autonomous agent to play the classical Snake Game, whose constraints get stricter as the game progresses further. Specifically, to train this model we have used Deep Neural Network (DNN) with a variant of Q-learning where agent will learn from its past experiences. Moreover, we have proposed a designed reward mechanism to properly train the network, adopt a training gap strategy to temporarily bypass training after the situation of the target changes, and also introduces dual experience replay method through which different experiences for better training can be categorized. The final results show that our agent in an environment outperforms the baseline model and surpasses the human-level performance in terms of playing the Snake Game.
References
- Chathurangi Shyalika, “A Beginners Guide to Q-Learning,” Towards Data Science, 15 Nov, 2019.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller, “Playing atari with deep reinforcement learning,” ArXiv e-prints, 2013.
- Chris Nicholson, “A Beginner's Guide to Deep Reinforcement Learning,” Path Mind.
- L.-J. Lin, “Reinforcement learning for robots using neural networks,” Ph.D. dissertation, Pittsburgh, PA, USA, 1992, UMI Order No. GAX93-22750.
- R. S. Sutton and A. G. Barto, “Reinforcement learning: An introduction,” IEEE Transactions on Neural Networks, vol. 9, no. 5, pp. 1054–1054, 1998.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness,M. G. Bellemare, A. Graves, M. A. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.
- D. Wang and A.-H. Tan, “Creating autonomous adaptive agents in a real-time first-person shooter computer game,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 7, no. 2, pp. 123–138, 2015.
- H. Y. Ong, K. Chavez, and A. Hong, “Distributed deep Q-learning,” ArXiv e-prints, 2015.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.