Ray Summit

DeepCoMP: Multi-agent Reinforcement Learning for Multi-cell Selection in 5G and Beyond

Tuesday, June 22, 9:20PM UTC

Stefan Schneider, PhD Candidate & Researcher, Paderborn University

View Slides >>>

We present DeepCoMP as outcome of a research project on dynamic multi-cell selection in future mobile networks. DeepCoMP is a (multi-agent) deep reinforcement learning approach using Ray RLlib that continuously coordinates user-cell connections in mobile networks. Connecting to and receiving data from multiple cells simultaneously using coordinated multipoint (CoMP) can greatly increase the received data rate and is crucial for AR/VR, smart manufacturing, cloud gaming, and vehicular networking scenarios in 5G and beyond. Selecting how many and which cells to serve which users is challenging as users compete for limited radio resources and channel state continuously changes with users moving around.

Existing approaches typically build on expert-tailored models and require strict assumptions or perfect knowledge of the underlying radio system and environment dynamics, which are often unavailable in practice. Instead, DeepCoMP has very limited built-in assumptions and learns to control multi-cell selection just from partial, realistically available observations and its own experience. We present three different variants of DeepCoMP using either centralized or distributed multi-agent deep reinforcement learning, discuss their strengths and weaknesses, and show that DeepCoMP outperforms other approaches by up to 231%. We also show how we used Ray RLlib to implement DeepCoMP and how RLlib simplified switching between centralized and multi-agent RL as well as local development and deployment of experiments on a private cluster.


Stefan Schneider

Stefan Schneider

PhD Candidate & Researcher, Paderborn University

Stefan Schneider is pursuing his computer science Ph.D. at the Paderborn University, Germany, working as research associate at the university’s computer networks group. His main research interests are management and optimization in networking and cloud computing—particularly in combination with machine learning. He has (co-)authored over 20 papers at international conferences or journals, received two awards for these publications, worked on multiple large research projects with academic and industry partners, and has been leading his own research project (RealVNF) in 2018-2021.