In this 2-hour tutorial, you will learn how to apply cutting edge reinforcement learning (RL) techniques in production with Ray RLlib.This tutorial includes a brief introduction to provide an overview of RL concepts. The tutorial will then cover how to use Ray RLlib to train and tune contextual bandits as well as the “SlateQ” algorithm, train off offline data using cutting edge offline algorithms, and deploy RL models into a live service.RLlib offers high scalability, a large list of algorithms to choose from (offline, model-based, model-free, etc..), support for TensorFlow and PyTorch, and a unified API for a variety of applications and customizations.This tutorial will be for you if you are an:
Industry ML engineer (not necessarily with a background in RL).
Industry software developers (that would like to use RL to solve problems within their expert domain, but are not RL experts).
Industry RL engineers who would like to learn about using RLlib for the specific use cases discussed here.