GLM 5.1
Specifications
- Input
- Output
- Context window
- 203K tokens
- Released
- Apr 2026
Performance
- Speed
- 34 t/s
- TTFT
- 431 ms
- Latency
- 947 ms
- Intelligence
- —
Pricing
- Input
- €1.30 per 1M tokens
- Output
- €4.00 per 1M tokens
About this model
ZAI GLM 5.1 is a 744B parameter Mixture-of-Experts language model built with the GLM‑MoE DSA architecture. It excels at agentic engineering, achieving state-of-the-art performance on benchmarks such as HLE with tools (52.3), SWE‑Bench Pro (58.4) and AIME 2026 (95.3). The model supports extensive tool use and long‑horizon reasoning, with a large context window of up to 128K tokens. It is released under the MIT license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Apr 2026
Today
Since release 2 mo
See also
Add Model to Comparison
Search for a model to add