Gemma 4 26B A4B
Specifications
- Input
- Output
- Context window
- 256K tokens
- Released
- Apr 2026
Performance
- Speed
- 53 t/s
- TTFT
- —
- Latency
- 84 ms
- Intelligence
- —
Pricing
- Input
- €0.25 per 1M tokens
- Output
- €0.50 per 1M tokens
About this model
Google Gemma 4 26B A4B is a 25.2B parameter Mixture-of-Experts (MoE) multimodal language model with a 256K token context window. It supports text, image, and video inputs and generates text output, featuring a configurable thinking mode for step‑by‑step reasoning. The model achieves 82.6% on MMLU Pro, 77.1% on LiveCodeBench v6, and 86.3% on MMMLU, demonstrating strong performance across reasoning and multimodal benchmarks. It is released under the Apache 2.0 license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Knowledge cutoff Jan 2025
Released Apr 2026
Today
Training to release 15 mo Since release 2 mo
See also
Add Model to Comparison
Search for a model to add