Home Events[Ray Meetup] Ray + vLLM in Action: Lessons from Pinterest and Deepseek Deployments

Ray Meetup

[Ray Meetup] Ray + vLLM in Action: Lessons from Pinterest and Deepseek Deployments

Tuesday, June 10 6 PM PDT

Join us for our next Ray Meetup where we’ll explore batch inference at scale with Ray and vLLM! Learn how Pinterest scales batch inference using Ray, and get a first look at Anyscale’s latest tools—Ray Serve and Data LLM—for orchestrating large-scale LLM inference. We’ll cover topics like batch inference, prefill-decode disaggregation, DP/EP parallelism, and custom request routing.

📆 Tuesday, June 10th, 2025

🕔 5:00pm

📌 55 Hawthorne St, San Francisco

Speakers:
Chia-Wei Chen, Software Engineer, ML Training Infra, Pinterest
Kourosh Hakhamaneshi, AI Lead, Anyscale

Learn more and register here.