Utilization of Supervised and Reinforcement Learning in the Automation of the Classical Atari Game "Pong"
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The classical "Pong" game resembles 2-player table tennis and was developed by Atari in 1972. In this game, each player controls a small paddle to bounce a ball across the rectangular area to defend a small goal on either side of the arena. Pong became extremely popular and is generally considered to be the first commercially successful video game. It has even earned itself a spot in the Smithsonian Institution in Washington, D.C. because of its level of cultural impact. This research project was an attempt to automate Pong through the use of two radically different machine learning methods, supervised learning and reinforcement learning. In supervised learning, an expert provides the training data, which consists of example input-output pairs to be used for the learning. In our case, a human controlled one paddle in response to the ball, while the other paddle moved up and down at a defined rate of 2.64 seconds per cycle. Then, an artificial neural network with four layers was trained from this dataset. In reinforcement learning, by using a reward system developed to encourage the paddle to defend the goal by bouncing the ball and receiving points, a controller agent was trained using Deep Q-Neural Networks (DQN). This method allows the computer to teach itself through trial and error. The supervised learning method generated an automated paddle that was deemed unbeatable by several challengers; while the reinforcement learning method was only capable of producing a controller agent with an average of 3-5 ball bounces per episode.