gemini-2.5-flash-native-audio-preview-12-2025

Context

131K

Max output

Capabilities

vision

tool call

structured output

reasoning

json mode

streaming

fine tuning

batch

Details

CreatorGoogle AI

Statusactive

Input modalitiestext, image, video, audio

Output modalitiestext, image, audio

Knowledge cutoff2025-01

Release date—

Deprecation date—

Type—

Reasoning tokens—

Max input—

Sourceofficial

Last updated2026-03-22

Tools

Function Callingfunction_calling

Call external functions and APIs

Endpoints

Generate ContentPOST

Generate text from multimodal inputhttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:generateContent

Stream ContentPOST

Stream text generation responseshttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:streamGenerateContent

Family Comparison: gemini-2.5

Model	Perf	Speed	Context	Max out	Input	Output
gemini-2.5-flash-image-previewVS	—	—	—	—	—	—
gemini-2.5-flash-lite-preview-09-2025RVTS	—	—	1.0M	66K	—	—
gemini-2.5-flash-native-audio-preview-12-2025RVTS	—	—	131K	8K	—	—
gemini-2.5-flash-preview-05-20VS	—	—	—	—	—	—
gemini-2.5-flash-preview-09-2025RVTS	—	—	1.0M	66K	—	—
gemini-2.5-flash-preview-09-25VS	—	—	—	—	—	—
gemini-2.5-pro-preview-03-25VS	—	—	—	—	—	—
gemini-2.5-pro-preview-05-06VS	—	—	—	—	—	—
gemini-2.5-pro-preview-06-05VS	—	—	—	—	—	—
gemini-2.5-flash-liteRVTS	—	—	1.0M	66K	$0.1	$0.4
gemini-2.5-flash-imageRVTS	—	—	66K	33K	$0.3	$0.04
gemini-2.5-flashRVTS	—	—	1.0M	66K	$0.3	$2.5
gemini-2.5-flash-preview-ttsRVTS	—	—	8K	16K	$0.5	$10
gemini-2.5-pro-preview-ttsRVTS	—	—	8K	16K	$1	$20
gemini-2.5-computer-use-preview-10-2025RVTS	—	—	128K	64K	$1.25	$10
gemini-2.5-proRVTS	—	—	1.0M	66K	$1.25	$10

API

GET/v1/models/google/gemini-2.5-flash-native-audio-preview-12-2025