google/gemini-3.1-flash-lite

proprietary

The model supports text, image, video, audio, and PDF inputs, and is designed for high-volume agentic workflows, simple data extraction, and applications where latency and API cost are the primary constraints.

Context
1.0M
Max output
66K
Input price
$0.25/1M tokens
Output price
$1.5/1M tokens

Capabilities

vision
tool call
structured output
reasoning
json mode
streaming
fine tuning
batch

Details

Provider Perplexity
Creator Google AI
Licenseproprietary
Parameters
Statusactive
Input modalitiestext
Output modalitiestext
Architecture
Knowledge cutoff
Training data cutoff
Release date
Deprecation date
Typechat
Reasoning tokens
Max input
Open weightNo
Sourceofficial
Last updated

Pricing

Input
$0.25
Output
$1.5
Cache write
Cache read
Batch in
Batch out

Family Comparison: gemini-3.1

ModelContextMax outPricing
google/gemini-3.1-flash-lite-preview1.0M66K$0.25$1.5
google/gemini-3.1-flash-lite1.0M66K$0.25$1.5
google/gemini-3.1-pro-preview1.0M66K$2$12

API

GET/v1/models/perplexity/google/gemini-3.1-flash-lite