MiniMax M3

by MiniMax

Specifications

Input
Output
Context window: 1M tokens
Released: Jun 2026

Performance

Speed: 82 t/s
TTFT: —
Latency: 341 ms
Intelligence: —

Pricing

Input: €0.40
Output: €2.00

About this model

MiniMax M3 is a 428B‑parameter multimodal language model with a sparse MoE architecture that supports text, image, and video inputs and generates text. It features a 1‑million token context window enabled by MiniMax Sparse Attention, delivering 9× faster prefill and 15× faster decode compared to its predecessor. The model achieves frontier‑level performance on long‑horizon agentic benchmarks and excels in coding and cowork tasks. It is released under an Other license.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Hybrid Default on

Knowledge horizon

Released Jun 2026

Today

Since release 0 mo

MiniMax M3

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

DeepSeek V4 Flash

MiniMax M2.5

Mistral Medium 3.5 128B