Z.ai

GLM 4.7 Flash

z-ai/glm-4.7-flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Input

$0.078 / 1M tokens

Output

$0.52 / 1M tokens

Context

203K tokens

Capabilities

✓ Streaming✓ Tool / function calling✓ JSON / structured output○ Vision (image input)

Call this model

Pass z-ai/glm-4.7-flash as the model. Change only the base URL from your existing OpenAI SDK setup.

quickstart.sh

curl https://modeldatabase.com/v1/chat/completions \
  -H "Authorization: Bearer $MDB_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-4.7-flash",
    "messages": [{"role":"user","content":"Hello!"}]
  }'

Prices are final — the routing margin is already included. See the full pricing table or the API docs.