{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "Copyright (c) Microsoft Corporation. All rights reserved.\n", "\n", "Licensed under the MIT License." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Reinforcement Learning in Azure Machine Learning - Pong problem\n", "Reinforcement Learning in Azure Machine Learning is a managed service for running distributed reinforcement learning training and simulation using the open source Ray framework.\n", "This noteboook demonstrates how to use Ray to solve a more complex problem using a more complex setup including Ray RLLib running on multiple compute nodes and using a GPU.\n", "For this example we will train a Pong playing agent on cluster of two NC6 nodes (6 CPU, 1 GPU).\n", "\n", "## Pong problem\n", "[Pong](https://en.wikipedia.org/wiki/Pong) is a two-dimensional sports game that simulates table tennis. The player controls an in-game paddle by moving it vertically across the left or right side of the screen. They can compete against another player controlling a second paddle on the opposing side. Players use the paddles to hit a ball back and forth." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
![]() | \n",
"
|---|
| Fig 1. Pong game animation (from towardsdatascience.com). | \n", "