HomeEventsHuman-in-the-Loop Reinforcement Learning

Ray Summit

Human-in-the-Loop Reinforcement Learning

Pieter Abbeel, Professor, UC Berkeley | Founder, Covariant | Host, The Robot Brains Podcast

View Slides >>>

Deep reinforcement learning (Deep RL) has seen many successes, including learning to play Atari games, the classical game of Go, robotic locomotion and manipulation. However, now that Deep RL has become fairly capable of optimizing reward, a new challenge has arisen: How to choose the reward function that is to be optimized? Indeed, this often becomes the key engineering time sink for practitioners. In this talk, I will present some recent progress on human-in-the-loop reinforcement learning. The newly proposed algorithm, PEBBLE, empowers a human supervisor to directly teach an AI agent new skills without the usual extensive reward engineering or curriculum design efforts.

Speakers

Pieter Abbeel

Pieter Abbeel

Professor, UC Berkeley | Founder, Covariant | Host, The Robot Brains Podcast, UC Berkeley | Covariant | The Robot Brains

Other Events

Ray Summit 2026

08 . 24 . 2026  ,  07:00 AM (PST)

Ray Summit 2024

09 . 30 . 2024  ,  03:00 PM (PST)

Ray Summit 2023

09 . 18 . 2023  ,  03:30 PM (PST)