gemini-2.5-flash-native-audio-preview-12-2025

Context
131K
Max output
8K
Capabilities
vision
tool call
structured output
reasoning
json mode
streaming
fine tuning
batch
Details
Provider Google AI
Creator Google AI
Statusactive
Input modalitiestext, image, video, audio
Output modalitiestext, image, audio
Knowledge cutoff2025-01
Release date
Deprecation date
Type
Reasoning tokens
Max input
Sourceofficial
Last updated2026-03-22
Tools
Function Callingfunction_calling
Call external functions and APIs
Endpoints
Generate ContentPOST
Generate text from multimodal inputhttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:generateContent
Stream ContentPOST
Stream text generation responseshttps://generativelanguage.googleapis.com/v1beta/v1beta/models/{model}:streamGenerateContent
Family Comparison: gemini-2.5
ModelContextMax outInputOutput
1.0M66K
gemini-2.5-flash-native-audio-preview-12-2025RVTS
131K8K
1.0M66K
1.0M66K$0.1$0.4
66K33K$0.3$0.04
1.0M66K$0.3$2.5
8K16K$0.5$10
8K16K$1$20
128K64K$1.25$10
1.0M66K$1.25$10
API
GET/v1/models/google/gemini-2.5-flash-native-audio-preview-12-2025