HomeEventsBuilding a scalable ML model serving API with Ray Serve

Webinar

Building a scalable ML model serving API with Ray Serve

The demo will show how to:
- Deploy a trained Python model and scale it to a cluster using Ray Serve
- Improve the HTTP API using Ray Serve’s native FastAPI integration
- Compose multiple independently-scalable models into a single model, and run them in parallel to minimize latency.

LinkView slides >>>

Speakers

Tricia Fu

Tricia Fu

Product Manager, Anyscale, Anyscale

Other Events

Anyscale on Azure: Build and deploy AI at scale in your own tenant

06 . 16 . 2026  ,  03:30 PM (PST)

How Torc Robotics Scales Multimodal AI for Autonomous Driving with Ray

06 . 10 . 2026  ,  03:30 PM (PST)

Building a Multimodal Video Processing Pipeline with Ray

05 . 28 . 2026  ,  03:30 PM (PST)