Gemma 4 31B

by Google

Specifications

Input
Output
Context window: 256K tokens
Released: Apr 2026

Performance

Speed: 40 t/s
TTFT: 736 ms
Latency: 311 ms
Intelligence: —

Pricing

Input: €0.10
Output: €0.30

About this model

Google Gemma 4 31B is a 31B parameter dense multimodal language model with a 256K context window. It processes text, images, and video inputs and generates text output, featuring a configurable thinking mode for step‑by‑step reasoning. The model achieves 85.2% on MMLU Pro, 80.0% on LiveCodeBench v6, and 88.4% on MMMLU, demonstrating strong performance across reasoning and multimodal benchmarks. Available under the Apache 2.0 license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Hybrid Default on

Knowledge horizon

Knowledge cutoff Jan 2025

Released Apr 2026

Today

Training to release 15 mo Since release 3 mo

Gemma 4 31B

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

GPT-OSS 120B