Z.ai: GLM 4.6

deprecated

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex.

Context
203K
Max output
131K
Input price
$0.43/1M tokens
Output price
$1.74/1M tokens

Capabilities

vision
tool call
structured output
reasoning
json mode
streaming
fine tuning
batch

Details

Model IDz-ai/glm-4.6
Provider OpenRouter
Creator z-ai
Familyglm-4.6
License
Parameters
Statusdeprecated
Input modalitiestext
Output modalitiestext
Architecture
Knowledge cutoff
Training data cutoff
Release date
Deprecation date
Typechat
Reasoning tokens
Max input
Open weight
Sourceofficial
Last updated

Tools

Function Callingfunction_calling
Call external functions and APIs

Pricing

Input
$0.43
Output
$1.74
Cache write
Cache read
$0.08
Batch in
Batch out

Family Comparison: glm-4.6

ModelContextMax outPricing
Z.ai: GLM 4.6V131K24K$0.3$0.9
Z.ai: GLM 4.6203K131K$0.43$1.74

API

GET/v1/models/openrouter/z-ai/glm-4.6