DeepInfra

Cloud PlatformOpenAI compatible

Serverless inference cloud for open-weight LLMs, image, video, TTS, transcription, embedding, and reranker models with an OpenAI-compatible API.

Active models
190
Families
67
Founded2022
API Base URL
Type
Model
L3.1-70B-Euryale-v2.2Sao10K/L3.1-70B-Euryale-v2.2
L3.3-70B-Euryale-v2.3Sao10K/L3.3-70B-Euryale-v2.3
L3-8B-Lunaris-v1-TurboSao10K/L3-8B-Lunaris-v1-Turbo
GLM-5.1OSSzai-org/GLM-5.1
GLM-5OSSzai-org/GLM-5
GLM-4.7OSSzai-org/GLM-4.7
GLM-4.7-FlashOSSzai-org/GLM-4.7-Flash
GLM-4.6OSSzai-org/GLM-4.6
Llama-4-Maverick-17B-128E-Instruct-FP8OSSmeta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
gemma-4-31B-itOSSgoogle/gemma-4-31B-it
110 of 190
1 / 19