Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is best for high-volume agentic tasks, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.
| Model | Context | Max out | Pricing |
|---|---|---|---|
| google/gemini-3.1-flash-lite-preview | 1.0M | 66K | $0.25$1.5 |
| google/gemini-3.1-flash-lite | 1.0M | 66K | $0.25$1.5 |
| google/gemini-3.1-pro-preview | 1.0M | 66K | $2$12 |
/v1/models/perplexity/google/gemini-3.1-flash-lite-previewOur most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is best for high-volume agentic tasks, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.