Groq

Cloud PlatformOpenAI compatible

Ultra-fast inference platform powered by custom LPU hardware for low-latency AI.

Active models
16
Families
8
Founded2016
API Base URL
Type
Model
Llama 4 Scout 17B 16EOSSmeta-llama/llama-4-scout-17b-16e-instruct
Llama 3.3 70Bllama-3.3-70b-versatile
Llama 3.1 8Bllama-3.1-8b-instant
Qwen3-32Bqwen/qwen3-32b
Kimi K2 0905moonshotai/kimi-k2-instruct-0905
GPT OSS 120BOSSopenai/gpt-oss-120b
GPT OSS 20BOSSopenai/gpt-oss-20b
Compoundgroq/compound
Compound Minigroq/compound-mini
Whisper Large V3 Turbowhisper-large-v3-turbo
1–10 of 16
1 / 2