Training an Agent using Deep Reinforcement Learning: Snake Game

Authors

  • Kartik Kaushik  Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
  • Reetej Chindarkar  Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
  • Rutuja Vetal  Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
  • Ronak Thusoo  Student, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Pune, Maharashtra, India
  • Prof. Pallavi Shimpi  Assistant professor, Department of Computer Engineering, Dr. D. Y. Patil School of Engineering, Lohegaon, Pune, Maharashtra, India

Keywords:

Deep reinforcement learning, Q-Learning, Deep Neural Network, Deep Learning, Experience replay.

Abstract

Deep Reinforcement Learning has become a commonly adopted method to enable agents to hunt out complex control policies in various video games. Deep-Mind used this technique to play Atari games. However, similar approaches should get to be improved when applied to tougher scenarios, where reward signals are sparse and delayed. This paper illustrates a refined Deep Reinforcement Learning model to enable an autonomous agent to play the classical Snake Game, whose constraints get stricter as the game progresses further. Specifically, to train this model we have used Deep Neural Network (DNN) with a variant of Q-learning where agent will learn from its past experiences. Moreover, we have proposed a designed reward mechanism to properly train the network, adopt a training gap strategy to temporarily bypass training after the situation of the target changes, and also introduces dual experience replay method through which different experiences for better training can be categorized. The final results show that our agent in an environment outperforms the baseline model and surpasses the human-level performance in terms of playing the Snake Game.

References

  1. Chathurangi Shyalika, “A Beginners Guide to Q-Learning,” Towards Data Science, 15 Nov, 2019.
  2. V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller, “Playing atari with deep reinforcement learning,” ArXiv e-prints, 2013.
  3. Chris Nicholson, “A Beginner's Guide to Deep Reinforcement Learning,” Path Mind.
  4. L.-J. Lin, “Reinforcement learning for robots using neural networks,” Ph.D. dissertation, Pittsburgh, PA, USA, 1992, UMI Order No. GAX93-22750.
  5. R. S. Sutton and A. G. Barto, “Reinforcement learning: An introduction,” IEEE Transactions on Neural Networks, vol. 9, no. 5, pp. 1054–1054, 1998.
  6. V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness,M. G. Bellemare, A. Graves, M. A. Riedmiller, A. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.
  7. D. Wang and A.-H. Tan, “Creating autonomous adaptive agents in a real-time first-person shooter computer game,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 7, no. 2, pp. 123–138, 2015.
  8. H. Y. Ong, K. Chavez, and A. Hong, “Distributed deep Q-learning,” ArXiv e-prints, 2015.

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Kartik Kaushik, Reetej Chindarkar, Rutuja Vetal, Ronak Thusoo, Prof. Pallavi Shimpi, " Training an Agent using Deep Reinforcement Learning: Snake Game" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 8, Issue 3, pp.279-286, May-June-2021.