Training an Agent using Deep Reinforcement Learning: Snake Game

Kartik Kaushik; Reetej Chindarkar; Rutuja Vetal; Ronak Thusoo; Prof. Pallavi Shimpi

doi:10.32628/CSEIT218344

Authors

Kartik Kaushik Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
Reetej Chindarkar Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
Rutuja Vetal Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
Ronak Thusoo Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
Prof. Pallavi Shimpi Assistant professor, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Lohegaon, Pune, Maharashtra, India

Keywords:

Deep reinforcement learning, Q-Learning, Deep Neural Network, Deep Learning, Experience replay.

Abstract

Deep Reinforcement Learning has become a commonly adopted method to enable agents to hunt out complex control policies in various video games. Deep-Mind used this technique to play Atari games. However, similar approaches should get to be improved when applied to tougher scenarios, where reward signals are sparse and delayed. This paper illustrates a refined Deep Reinforcement Learning model to enable an autonomous agent to play the classical Snake Game, whose constraints get stricter as the game progresses further. Specifically, to train this model we have used Deep Neural Network (DNN) with a variant of Q-learning where agent will learn from its past experiences. Moreover, we have proposed a designed reward mechanism to properly train the network, adopt a training gap strategy to temporarily bypass training after the situation of the target changes, and also introduces dual experience replay method through which different experiences for better training can be categorized. The final results show that our agent in an environment outperforms the baseline model and surpasses the human-level performance in terms of playing the Snake Game.

References

Chathurangi Shyalika, “A Beginners Guide to Q-Learning,” Towards Data Science, 15 Nov, 2019.
V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller, “Playing atari with deep reinforcement learning,” ArXiv e-prints, 2013.
Chris Nicholson, “A Beginner's Guide to Deep Reinforcement Learning,” Path Mind.
L.-J. Lin, “Reinforcement learning for robots using neural networks,” Ph.D. dissertation, Pittsburgh, PA, USA, 1992, UMI Order No. GAX93-22750.
R. S. Sutton and A. G. Barto, “Reinforcement learning: An introduction,” IEEE Transactions on Neural Networks, vol. 9, no. 5, pp. 1054–1054, 1998.
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness,M. G. Bellemare, A. Graves, M. A. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.
D. Wang and A.-H. Tan, “Creating autonomous adaptive agents in a real-time first-person shooter computer game,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 7, no. 2, pp. 123–138, 2015.
H. Y. Ong, K. Chavez, and A. Hong, “Distributed deep Q-learning,” ArXiv e-prints, 2015.

Training an Agent using Deep Reinforcement Learning: Snake Game

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite