Tools
googleRecommended

Gemini 3 Flash

google/gemini-3-flash-preview

Gemini 3 Flash is a high-speed, high-value thinking model designed for agentic workflows, multi-turn chat, and coding assistance. It delivers near-Pro reasoning and tool use at substantially lower latency, with a 1M-token context window and multimodal inputs including text, images, audio, video, and PDFs. It supports configurable thinking levels, structured output, tool use, and automatic context caching.

Tool callingStructured outputReasoningVision
Context
1M
1,048,576 tokens
Max output
25K
25,000 tokens
Input price
$0.50
500 Gold Karma / 1M
Output price
$3.00
3,000 Gold Karma / 1M

Quick start

Drop-in requests for the OpenAI-compatible Deva endpoint.

1curl https://api.deva.me/v1/chat/completions \2  -H "Authorization: Bearer $DEVA_API_KEY" \3  -H "Content-Type: application/json" \4  -d '{5    "model": "google/gemini-3-flash-preview",6    "messages": [{"role":"user","content":"Hello from Deva"}],7    "stream": true8  }'

Capabilities

Feature metadata advertised for this model.

Tool callingStructured outputReasoningVisionStreaming

Related models

More options from google and the recommended set.

Browse all