gemini-3.1-flash-lite-preview

Name: gemini-3.1-flash-lite-preview
Author: Google AI

proprietary

Our most cost-efficient multimodal model, offering the fastest performance for high-frequency, lightweight tasks. Gemini 3.1 Flash-Lite is best for high-volume agentic tasks, simple data extraction, and extremely low-latency applications where budget and speed are the primary constraints.

Context

1.0M

Max output

66K

Input price

$0.25/1M tokens

Output price

$1.5/1M tokens

Capabilities

vision

tool call

structured output

reasoning

json mode

streaming

fine tuning

batch

Details

ProviderGoogle AI

Creatorgoogle

Familygemini-3.1

Licenseproprietary

Parameters—

Statusactive

Input modalitiestext, image, video, audio

Output modalitiestext, image, audio

Architecture—

Knowledge cutoff2025-01

Training data cutoff—

Release date—

Deprecation date—

Typechat

Reasoning tokens—

Max input—

Open weightNo

Sourceofficial

Last updated2026-03-27

Tools

Function Callingfunction_calling

Call external functions and APIs

Endpoints

Generate ContentPOST

Generate text from multimodal inputhttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:generateContent

Stream ContentPOST

Stream text generation responseshttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:streamGenerateContent

Pricing

Text tokensPer 1M tokens

	Input	Cached input	Output
Standard	$0.25	$0.025	$1.5
Batch	$0.125	—	$0.75

Family Comparison: gemini-3.1

Model	Caps	Perf	Speed	Context	Max out	Pricing
gemini-3.1-flash-lite-preview	RVTS	—	—	1.0M	66K	$0.25$1.5
gemini-3.1-flash-lite	RVTS	—	—	1.0M	66K	$0.25$1.5
gemini-3.1-flash-image-preview	RVTS	—	—	131K	33K	$0.5$3
gemini-3.1-flash-live-preview	RVTS	—	—	131K	66K	$0.75$4.5
gemini-3.1-flash-tts-preview	RVTS	—	—	8K	16K	$1$20
gemini-3.1-pro-preview	RVTS	—	—	1.0M	66K	$2$12

API

GET/v1/models/google/gemini-3.1-flash-lite-preview