Model comparison
Gemini 2.5 Flash vs Llama 4 Scout
Gemini 2.5 Flash
Cheap multimodal at million-token scale.
Meta
Llama 4 Scout
Open-weights frontier with a headline 10M-token context.
Specs
| Metric | Gemini 2.5 Flash | Llama 4 Scout |
|---|---|---|
| Context window | 1.0M tokens↑ | 328K tokens |
| Input $/1M tokens | $0.300 | $0.080↑ |
| Output $/1M tokens | $2.50 | $0.300↑ |
| Modalities | File · Image · Text · Audio · Video | Text · Image |
| Open weights | No | Yes |
| Released | — | Apr 2025 |
Capability differences
| Capability | Gemini 2.5 Flash | Llama 4 Scout |
|---|---|---|
| Extended thinking | Yes | No |
| Prompt caching | Yes | No |
| Open weights | No | Yes |
Key differences
- —Gemini 2.5 Flash offers 1.0M context; Llama 4 Scout offers 328K.
- —Input pricing: Gemini 2.5 Flash at $0.300/1M vs Llama 4 Scout at $0.080/1M.
- —Llama 4 Scout is open weights; Gemini 2.5 Flash is proprietary.