Gemma 4 26B A4B

by Google

Specifications

Input
Output
Context window: 256K tokens
Released: Apr 2026

Performance

Speed: 227 t/s
TTFT: 336 ms
Latency: 79 ms
Intelligence: —

Pricing

Input: €0.25
Output: €0.50

About this model

Google Gemma 4 26B A4B is a 25.2B parameter Mixture-of-Experts (MoE) multimodal language model with a 256K token context window. It supports text, image, and video inputs and generates text output, featuring a configurable thinking mode for step‑by‑step reasoning. The model achieves 82.6% on MMLU Pro, 77.1% on LiveCodeBench v6, and 86.3% on MMMLU, demonstrating strong performance across reasoning and multimodal benchmarks. It is released under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Hybrid Default on

Knowledge horizon

Knowledge cutoff Jan 2025

Released Apr 2026

Today

Training to release 15 mo Since release 3 mo

Gemma 4 26B A4B

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B