z-ai

GLM 4.6

z-ai/glm-4.6

GLM-4.6 expands the context window to 200K tokens and delivers higher coding benchmark scores and stronger real-world performance in coding tools, including more visually polished front-end generation. It improves reasoning with tool use during inference, performs better as a tool-using and search agent within agent frameworks, and aligns more naturally in writing and role-play.

Tool callingStructured outputReasoning

Context

128K

128,000 tokens

Max output

25K

25,000 tokens

Input price

$0.39

390 Gold Karma / 1M

Output price

$1.90

1,900 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "z-ai/glm-4.6",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from z-ai and the recommended set.

Browse all

Z AI: GLM 5.1

Tool callingStructured outputReasoning

203K context$0.98/M in$3.08/M out

X AI: Grok 4.3

Tool callingStructured outputReasoningVision

1M context$1.25/M in$2.5/M out

ANTHROPIC: Claude Opus 4.7

Tool callingStructured outputReasoningVision

1M context$5/M in$25/M out

ANTHROPIC: Claude Sonnet 4.6

Tool callingStructured outputReasoningVision

1M context$3/M in$15/M out