GoogleFastCheapest
Gemini 2.5 Flash
Ultra-fast and ultra-cheap. Google's speed-optimized model for high-volume workloads.
Input Price
$0.15/1M tokens
Output Price
$0.60/1M tokens
Context Window
1M tokens
Latency (p50)
85msavg
Uptime
99.98%30d
Start using now
Select a billing plan and activate this model.
an_sk_8f3a...c9e1
Input$0.15/1M tokens
Output$0.60/1M tokens
Free tier1M tokens/month
No upfront commitment. Cancel anytime.
About this model
Gemini 2.5 Flash is designed for maximum throughput at minimum cost. With sub-100ms latency and industry-leading pricing, it's the ideal model for high-volume applications, real-time experiences, and cost-sensitive deployments that still need solid quality.
What it's best for
- High-volume API calls
- Real-time applications
- Cost-optimized pipelines
- Rapid content generation
Sample use cases
1
Autocomplete and suggestions2
Real-time classification3
Streaming chat applications4
Bulk data processingLive stats
Requests today
2.4M
Avg latency
85ms
Active agents
8,432
Error rate
0.02%