GoogleFastCheapest

Gemini 2.5 Flash

Ultra-fast and ultra-cheap. Google's speed-optimized model for high-volume workloads.

Input Price
$0.15/1M tokens
Output Price
$0.60/1M tokens
Context Window
1M tokens
Latency (p50)
85msavg
Uptime
99.98%30d

Start using now

Select a billing plan and activate this model.

an_sk_8f3a...c9e1
Input$0.15/1M tokens
Output$0.60/1M tokens
Free tier1M tokens/month

No upfront commitment. Cancel anytime.

About this model

Gemini 2.5 Flash is designed for maximum throughput at minimum cost. With sub-100ms latency and industry-leading pricing, it's the ideal model for high-volume applications, real-time experiences, and cost-sensitive deployments that still need solid quality.

What it's best for

  • High-volume API calls
  • Real-time applications
  • Cost-optimized pipelines
  • Rapid content generation

Sample use cases

1
Autocomplete and suggestions
2
Real-time classification
3
Streaming chat applications
4
Bulk data processing

Live stats

Requests today

2.4M

Avg latency

85ms

Active agents

8,432

Error rate

0.02%