Google: Gemini 3.5
google/gemini-3.5-flashCreated May 19, 2026|1049M context context|Starting at $1.5/M input tokens|Starting at $9/M output tokens
Gemini 3.5 Flash is Google's brand-new generation core AI model, launched in May 2026, featuring ultimate processing speed, powerful code writing capabilities, and the execution power of an autonomous AI Agent (AI Agent). Although positioned as the lightweight "Flash" series, its performance has surpassed the previous generation's flagship model in many benchmark tests, achieving a significant leap towards "mobile executor" while maintaining a high cost-performance ratio.
Providers for Gemini 3.5
Routes requests to the best providers that are able to handle your prompt size and parameters.
| Provider | Context Window | Max Output | Input Price | Output Price | Cache Read | Cache Write |
|---|---|---|---|---|---|---|
GCP Vertex | 1048576K | 65536K | $1.5 | $9 | $0 | — |