Models / Google
Google
Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Input
$0.13 / 1M tokens
Output
$0.52 / 1M tokens
Context
1049K tokens
Capabilities
✓ Streaming✓ Tool / function calling✓ JSON / structured output✓ Vision (image input)
Call this model
Pass google/gemini-2.5-flash-lite as the model. Change only the base URL from your existing OpenAI SDK setup.
quickstart.sh
curl https://modeldatabase.com/v1/chat/completions \
-H "Authorization: Bearer $MDB_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemini-2.5-flash-lite",
"messages": [{"role":"user","content":"Hello!"}]
}'
Prices are final — the routing margin is already included. See the full pricing table or the API docs.