Serverless inference cloud for open-weight LLMs, image, video, TTS, transcription, embedding, and reranker models with an OpenAI-compatible API.
| Model |
|---|
L3.1-70B-Euryale-v2.2Sao10K/L3.1-70B-Euryale-v2.2 |
L3.3-70B-Euryale-v2.3Sao10K/L3.3-70B-Euryale-v2.3 |
L3-8B-Lunaris-v1-TurboSao10K/L3-8B-Lunaris-v1-Turbo |
GLM-5.1OSSzai-org/GLM-5.1 |
GLM-5OSSzai-org/GLM-5 |
GLM-4.7OSSzai-org/GLM-4.7 |
GLM-4.7-FlashOSSzai-org/GLM-4.7-Flash |
GLM-4.6OSSzai-org/GLM-4.6 |
Llama-4-Maverick-17B-128E-Instruct-FP8OSSmeta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 |
gemma-4-31B-itOSSgoogle/gemma-4-31B-it |