Ultra-fast inference platform powered by custom LPU hardware for low-latency AI.
| Model |
|---|
Llama 4 Scout 17B 16EOSSmeta-llama/llama-4-scout-17b-16e-instruct |
Llama 3.3 70Bllama-3.3-70b-versatile |
Llama 3.1 8Bllama-3.1-8b-instant |
Qwen3-32Bqwen/qwen3-32b |
Kimi K2 0905moonshotai/kimi-k2-instruct-0905 |
GPT OSS 120BOSSopenai/gpt-oss-120b |
GPT OSS 20BOSSopenai/gpt-oss-20b |
Compoundgroq/compound |
Compound Minigroq/compound-mini |
Whisper Large V3 Turbowhisper-large-v3-turbo |