Endpoints Team

function-calling

json-mode

Anyscale Endpoints is the first LLM APIs providing a wide range of capabilities to empower developers to build their applications not just from serving and fine tuning LLMs, but also leveraging embedding services and function calling.

- Powerful, unified platform for all your AI jobs from training to inference and fine-tuning
- Powered by Ray. Built by the Ray creators. Ray is the high-performance technology behind many of the most sophisticated AI projects in the world (OpenAI, Uber, Netflix, Spotify)
- AI App building and experimentation without the Infra and Ops headaches
- Multi-cloud and on-prem hybrid support

Get started today with Anyscale's self-service AI/ML platform:

anyscale-endpoints-llama-and-orca

Access Anyscale today to see how companies using Anyscale and Ray benefit from rapid time-to-market and faster iterations across the entire AI lifecycle.

Anyscale Endpoints: JSON Mode, Function calling, New models: Llama Guard and Mistral-7B-OpenOrca

Anyscale Preview is now available! Login today to get free $50 compute credit 🚀

Anyscale is the leading AI application platform. With Anyscale, developers can build, run and scale AI applications instantly.

Anyscale Endpoints: JSON Mode, Function calling, New models: Llama Guard and Mistral-7B-OpenOrca

LinkJSON Mode and Function Calling with Mistral 7B (public preview)

LinkModel: Llama Guard Model (Public Preview)

LinkModel: Mistral-7B-OpenOrca

Table of contents

Sharing

Sign up for product updates

Recommended content

Easily Debug Ray Applications with Ray Distributed Debugger

Processing 2 Billion Images for Stable Diffusion Model Training - Definitive Guides with Ray Series

Reducing the Cost of Pre-training Stable Diffusion by 3.7x with Anyscale