Mistral Medium 3.5 128B
Specifications
- Input
- Output
- Context window
- 256K tokens
- Released
- Apr 2026
Performance
- Speed
- 31 t/s
- TTFT
- —
- Latency
- 655 ms
- Intelligence
- —
Pricing
- Input
- €1.50 per 1M tokens
- Output
- €5.00 per 1M tokens
About this model
Mistral Medium 3.5 128B is a dense 128‑billion‑parameter Mistral‑3 model with a 256k context window. It supports multimodal input (text and images) and offers instruction‑following, reasoning, and coding capabilities with native function calling and JSON output. In benchmarks it achieves 91.4% on τ³‑Telecom and 77.6% on SWE‑Bench Verified, demonstrating strong agentic performance. The model is released under a Modified MIT License.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default off
Knowledge horizon
Released Apr 2026
Today
Since release 2 mo
See also
Add Model to Comparison
Search for a model to add