gemini-3.1-flash-lite-preview

proprietary

Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is best for high-volume agentic tasks, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.

Context
1.0M
Max output
66K
Input price
$0.25/1M tokens
Output price
$1.5/1M tokens

Capabilities

vision
tool call
structured output
reasoning
json mode
streaming
fine tuning
batch

Details

Provider Google AI
Creator google
Licenseproprietary
Parameters
Statusactive
Input modalitiestext, image, video, audio
Output modalitiestext, image, audio
Architecture
Knowledge cutoff
Training data cutoff
Release date
Deprecation date
Typechat
Reasoning tokens
Max input
Open weightNo
Sourceofficial
Last updated

Tools

Function Callingfunction_calling
Call external functions and APIs

Endpoints

Generate ContentPOST
Generate text from multimodal inputhttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:generateContent
Stream ContentPOST
Stream text generation responseshttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:streamGenerateContent

Pricing

Text tokensPer 1M tokens
InputCached inputOutput
Standard$0.25$0.025$1.5
Batch$0.125$0.75

Family Comparison: gemini-3.1

ModelContextMax outPricing
gemini-3.1-flash-lite-preview1.0M66K$0.25$1.5
gemini-3.1-flash-lite1.0M66K$0.25$1.5
gemini-3.1-flash-image-preview131K33K$0.5$3
gemini-3.1-flash-live-preview131K66K$0.75$4.5
gemini-3.1-flash-tts-preview8K16K$1$20
gemini-3.1-pro-preview1.0M66K$2$12

API

GET/v1/models/google/gemini-3.1-flash-lite-preview

gemini-3.1-flash-lite-preview

Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is best for high-volume agentic tasks, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.

Changes · 9 entries
gemini-3.1-flash-lite-previewupdate36c567eMar 27, 2026, 07:38 PM
statusdeprecatedactive
deprecation_date2026-03-03
gemini-3.1-flash-lite-previewupdatefa2bbbbMar 24, 2026, 05:22 AM
descriptiona Latina Français Indonesia Italiano Polski Português – B...Our most cost-efficient multimodal model, offering the fa...
taglinea Latina Français Indonesia Italiano Polski Português – B...Our most cost-efficient multimodal model, offering the fa...
gemini-3.1-flash-lite-previewupdate1cd659eMar 24, 2026, 04:19 AM
capabilitiesjson modeyes
gemini-3.1-flash-lite-previewupdateb301ebeMar 23, 2026, 11:47 PM
descriptiona Latina Français Indonesia Italiano Polski Português – B...
taglinea Latina Français Indonesia Italiano Polski Português – B...
gemini-3.1-flash-lite-previewupdate2b100acMar 23, 2026, 05:30 AM
model_typechat
licenseproprietary
page_urlhttps://ai.google.dev/gemini-api/docs/models/gemini-3.1-f...
open_weightno
gemini-3.1-flash-lite-previewupdate7a4f1f6Mar 22, 2026, 05:43 AM
modalitiesoutput["text","image"]["text","image","audio"]
pricinginput$0.25output$1.5cached input$0.025batch input$0.125batch output$0.75tiers[{"label":"Text tokens","unit":"Per 1M tokens","columns":...
gemini-3.1-flash-lite-previewupdateb50649fMar 21, 2026, 11:58 PM
tools["function_calling"]
endpoints["generateContent","streamGenerateContent"]
gemini-3.1-flash-lite-previewupdatec9a70c5Mar 21, 2026, 08:12 AM
deprecation_dateMarch 3, 20262026-03-03
knowledge_cutoffJanuary 20252025-01
gemini-3.1-flash-lite-previewcreate7f5c66fMar 21, 2026, 05:16 AM