The model supports text, image, video, audio, and PDF inputs, and is designed for high-volume agentic workflows, simple data extraction, and applications where latency and API cost are the primary constraints.
| Model | Context | Max out | Pricing |
|---|---|---|---|
| google/gemini-3.1-flash-lite-preview | 1.0M | 66K | $0.25$1.5 |
| google/gemini-3.1-flash-lite | 1.0M | 66K | $0.25$1.5 |
| google/gemini-3.1-pro-preview | 1.0M | 66K | $2$12 |
/v1/models/perplexity/google/gemini-3.1-flash-lite