MiniMax M3
Specifications
- Input
- Output
- Context window
- 1M tokens
- Released
- Jun 2026
Performance
- Speed
- 82 t/s
- TTFT
- —
- Latency
- 341 ms
- Intelligence
- —
Pricing
- Input
- €0.40 per 1M tokens
- Output
- €2.00 per 1M tokens
About this model
MiniMax M3 is a 428B‑parameter multimodal language model with a sparse MoE architecture that supports text, image, and video inputs and generates text. It features a 1‑million token context window enabled by MiniMax Sparse Attention, delivering 9× faster prefill and 15× faster decode compared to its predecessor. The model achieves frontier‑level performance on long‑horizon agentic benchmarks and excels in coding and cowork tasks. It is released under an Other license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Jun 2026
Today
Since release 0 mo
See also
Add Model to Comparison
Search for a model to add