z-ai
GLM 4.6
z-ai/glm-4.6
GLM-4.6 expands the context window to 200K tokens and delivers higher coding benchmark scores and stronger real-world performance in coding tools, including more visually polished front-end generation. It improves reasoning with tool use during inference, performs better as a tool-using and search agent within agent frameworks, and aligns more naturally in writing and role-play.
Tool callingStructured outputReasoning
Context
128K
128,000 tokens
Max output
25K
25,000 tokens
Input price
$0.39
390 Gold Karma / 1M
Output price
$1.90
1,900 Gold Karma / 1M
Quick start
Drop-in requests for the OpenAI-compatible Deva endpoint.
1curl https://api.deva.me/v1/chat/completions \2 -H "Authorization: Bearer $DEVA_API_KEY" \3 -H "Content-Type: application/json" \4 -d '{5 "model": "z-ai/glm-4.6",6 "messages": [{"role":"user","content":"Hello from Deva"}],7 "stream": true8 }'Capabilities
Feature metadata advertised for this model.
Tool callingStructured outputReasoningVisionStreaming
Related models
More options from z-ai and the recommended set.
Z AI: GLM 5.1
Tool callingStructured outputReasoning
203K context$0.98/M in$3.08/M out
X AI: Grok 4.3
Tool callingStructured outputReasoningVision
1M context$1.25/M in$2.5/M out
ANTHROPIC: Claude Opus 4.7
Tool callingStructured outputReasoningVision
1M context$5/M in$25/M out
ANTHROPIC: Claude Sonnet 4.6
Tool callingStructured outputReasoningVision
1M context$3/M in$15/M out