DeepSeek-V4-Flash Preview

Name: DeepSeek-V4-Flash Preview
Author: Azure AI Foundry

mit

DeepSeek-V4-Flash hybrid model with both non-thinking and thinking (default) modes.

Context

Max output

384K

Input price

$0.14/1M tokens

Output price

$0.28/1M tokens

Capabilities

vision

tool call

structured output

reasoning

json mode

streaming

fine tuning

batch

Details

Model IDdeepseek-v4-flash

ProviderAzure AI Foundry

CreatorDeepSeek

Familydeepseek

Licensemit

Parameters—

Statusactive

Input modalitiestext

Output modalitiestext

Architecture—

Knowledge cutoff—

Training data cutoff—

Release date2026-04-24

Deprecation date—

Typechat

Reasoning tokensYes

Max input—

Open weightYes

Sourceofficial

Last updated2026-05-14

Tools

Function Callingfunction_calling

Call external functions and APIs

Pricing

Input

$0.14

Output

$0.28

Cache write

—

Cache read

Batch in

—

Batch out

—

Family Comparison: deepseek

Model	Caps	Perf	Speed	Context	Max out
DeepSeek-V3-0324 Preview	RVTS	—	—	—	—
DeepSeek-V3.1 Preview	RVTS	—	—	—	—
DeepSeek-V3.2-Speciale Preview	RVTS	—	—	—	—
DeepSeek-V3.2 Preview	RVTS	—	—	—	—
DeepSeek-V4-Fast Preview	RVTS	—	—	—	—
DeepSeek-V4-Flash Preview	RVTS	—	—	1M	384K

API

GET/v1/models/azure/deepseek-v4-flash