Model comparison
Gemini 2.5 Flash vs Llama 4 Maverick
Gemini 2.5 Flash
Cheap multimodal at million-token scale.
Meta
Llama 4 Maverick
The bigger Llama 4 — frontier quality you can self-host.
Specs
| Metric | Gemini 2.5 Flash | Llama 4 Maverick |
|---|---|---|
| Context window | 1.0M tokens↑ | 1.0M tokens |
| Input $/1M tokens | $0.300 | $0.150↑ |
| Output $/1M tokens | $2.50 | $0.600↑ |
| Modalities | File · Image · Text · Audio · Video | Text · Image |
| Open weights | No | Yes |
| Released | — | Apr 2025 |
Capability differences
| Capability | Gemini 2.5 Flash | Llama 4 Maverick |
|---|---|---|
| Extended thinking | Yes | No |
| Prompt caching | Yes | No |
| Open weights | No | Yes |
Key differences
- —Input pricing: Gemini 2.5 Flash at $0.300/1M vs Llama 4 Maverick at $0.150/1M.
- —Llama 4 Maverick is open weights; Gemini 2.5 Flash is proprietary.