Cerebras

Cloud PlatformOpenAI compatible

Ultra-fast inference powered by wafer-scale compute chips, offering the fastest LLM speeds.

Active models
3+ 12 deprecated
Families
5
Founded2016
API Base URL
Model
Qwen 3 235B InstructOSSqwen-3-235b-a22b-instruct
Z.ai GLM 4.7zai-glm-4.7
OpenAI GPT OSSOSSgpt-oss-120b