Command Palette
Search for a command to run

Gemma 4 26B A4B

by Google

Specifications

Input
Output
Context window
256K tokens
Released
Apr 2026

Performance

Speed
53 t/s
TTFT
Latency
84 ms
Intelligence

Pricing

Input
€0.25
per 1M tokens
Output
€0.50
per 1M tokens

About this model

Google Gemma 4 26B A4B is a 25.2B parameter Mixture-of-Experts (MoE) multimodal language model with a 256K token context window. It supports text, image, and video inputs and generates text output, featuring a configurable thinking mode for step‑by‑step reasoning. The model achieves 82.6% on MMLU Pro, 77.1% on LiveCodeBench v6, and 86.3% on MMMLU, demonstrating strong performance across reasoning and multimodal benchmarks. It is released under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
Hybrid Default on

Knowledge horizon

Knowledge cutoff Jan 2025
Released Apr 2026
Today
Training to release 15 mo Since release 2 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run