GLM 5.2
Specifications
- Input
- Output
- Context window
- 1M tokens
- Released
- Feb 2026
Performance
- Speed
- 76 t/s
- TTFT
- 1.6s
- Latency
- 228 ms
- Intelligence
- —
Pricing
- Input
- €1.50 per 1M tokens
- Output
- €4.50 per 1M tokens
About this model
ZAI GLM 5.2 is a 744B parameter Mixture-of-Experts (MoE) language model built on the GLM-MoE DSA architecture. It features a solid 1‑million token context window and multiple thinking effort levels for flexible reasoning and coding performance. The model achieves strong benchmark results, e.g., 54.7 on HLE (with tools), 99.2 on AIME 2026, and 91.2 on GPQA‑Diamond. It also excels in agentic tasks such as MCP‑Atlas (76.8) and Tool‑Decathlon (48.2). GLM 5.2 is released under the MIT license.
Technical specifications
- Capabilities
- Input modalities
- Output modalities
- Reasoning
- Hybrid Default on
Knowledge horizon
Released Feb 2026
Today
Since release 4 mo
See also
Add Model to Comparison
Search for a model to add