Models / Qwen
Qwen
Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Input
$0.1352 / 1M tokens
Output
$0.5408 / 1M tokens
Context
262K tokens
Capabilities
✓ Streaming✓ Tool / function calling✓ JSON / structured output✓ Vision (image input)
Call this model
Pass qwen/qwen3-vl-32b-instruct as the model. Change only the base URL from your existing OpenAI SDK setup.
quickstart.sh
curl https://modeldatabase.com/v1/chat/completions \
-H "Authorization: Bearer $MDB_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3-vl-32b-instruct",
"messages": [{"role":"user","content":"Hello!"}]
}'
Prices are final — the routing margin is already included. See the full pricing table or the API docs.