Cerebras

Cloud PlatformOpenAI compatible

Ultra-fast inference powered by wafer-scale compute chips, offering the fastest LLM speeds.

Active models
2+ 14 deprecated
Families
5
Founded2016
API Base URL
Model
Qwen 3 235B InstructOSSqwen-3-235b-a22b-instruct
OpenAI GPT OSSOSSgpt-oss-120b