Model comparison
Gemini 2.5 Flash vs Llama 4 Scout
Gemini 2.5 Flash
Cheap multimodal at million-token scale.
Meta
Llama 4 Scout
Open-weights frontier with a headline 10M-token context.
Specs
| Metric | Gemini 2.5 Flash | Llama 4 Scout |
|---|---|---|
| Context window | 1.0M tokens | 10M tokens↑ |
| Input $/1M tokens | $0.300 | $0.080↑ |
| Output $/1M tokens | $2.50 | $0.300↑ |
| Modalities | File · Image · Text · Audio · Video | Text · Image |
| Open weights | No | Yes |
| Released | - | Apr 2025 |
Capability differences
| Capability | Gemini 2.5 Flash | Llama 4 Scout |
|---|---|---|
| Extended thinking | Yes | No |
| Prompt caching | Yes | No |
| Open weights | No | Yes |
Key differences
- -Gemini 2.5 Flash offers 1.0M context; Llama 4 Scout offers 10M.
- -Input pricing: Gemini 2.5 Flash at $0.300/1M vs Llama 4 Scout at $0.080/1M.
- -Llama 4 Scout is open weights; Gemini 2.5 Flash is proprietary.