Ray Logo
Anyscale

MARCH 29, 2022 - 9 AM PST - FREE & VIRTUAL

Production RL Summit

The reinforcement learning event designed for practitioners

Register
hero-rl-summit
hero-rl-summit

[Tutorial] Reinforcement learning for recommender systems - from Contextual Bandits to SlateQ and Offline RL with Ray RLlib

March 29, 1:00 PM - 3:00 PM

[Requires Tutorial Pass]

Top 3 reasons to attend the RL Summit Tutorial:

  1. RL is becoming more broadly used in production, and RLlib is becoming the industry standard way of getting started and scaling out RL.
  2. This tutorial covers new bleeding edge RecSys features in RLlib, not covered in any other online material. Such as Google's Recsim env with SlateQ algorithm on PyTorch, offline RL, and how Researchers can do "fast iteration".
  3. You will have personalized instruction and interaction with Sven Mika, the tech lead for RLlib, and the rest of the RLlib team.

In this 2-hour tutorial, you will learn how to apply cutting edge reinforcement learning (RL) techniques in production with Ray RLlib.

This tutorial includes a brief introduction to provide an overview of RL concepts. The tutorial will then cover how to use Ray RLlib to train and tune contextual bandits as well as the “SlateQ” algorithm, train off offline data using cutting edge offline algorithms, and deploy RL models into a live service.

RLlib offers high scalability, a large list of algorithms to choose from (offline, model-based, model-free, etc..), support for TensorFlow and PyTorch, and a unified API for a variety of applications and customizations.

This tutorial will be for you if you are an:

  • Industry ML engineer (not necessarily with a background in RL).
  • Industry software developers (that would like to use RL to solve problems within their expert domain, but are not RL experts).
  • Industry RL engineers who would like to learn about using RLlib for the specific use cases discussed here.

Speakers

Sven Mika

Sven Mika

Machine Learning Engineer, Anyscale

Christy Bergman

Christy Bergman

Developer Advocate, Anyscale

Richard Liaw

Richard Liaw

Engineering Manager, Anyscale

Avnish Narayan

Avnish Narayan

Software Engineer, Anyscale