GoogleFastCheapest

Gemini 2.5 Flash

Ultra-fast and ultra-cheap. Google's speed-optimized model for high-volume workloads.

Input Price

$0.15/1M tokens

Output Price

$0.60/1M tokens

Context Window

1M tokens

Latency (p50)

85msavg

Uptime

99.98%30d

Start using now

Select a billing plan and activate this model.

API Key

an_sk_8f3a...c9e1

Input$0.15/1M tokens

Output$0.60/1M tokens

Free tier1M tokens/month

No upfront commitment. Cancel anytime.

About this model

Gemini 2.5 Flash is designed for maximum throughput at minimum cost. With sub-100ms latency and industry-leading pricing, it's the ideal model for high-volume applications, real-time experiences, and cost-sensitive deployments that still need solid quality.

What it's best for

High-volume API calls
Real-time applications
Cost-optimized pipelines
Rapid content generation

Sample use cases

Autocomplete and suggestions

Real-time classification

Streaming chat applications

Bulk data processing

Live stats

Requests today

2.4M

Avg latency

85ms

Active agents

8,432

Error rate

0.02%