GLM-4.6
Specifications
- Input
- Output
- Context window
- 203K tokens
- Veröffentlicht
- Sep 2025
Performance
- Speed
- 26 t/s
- TTFT
- 1.2s
- Latency
- 24.0s
- Intelligence
- —
Pricing
- Eingabe
- €0.40 per 1M tokens
- Ausgabe
- €1.60 per 1M tokens
Über dieses Modell
GLM-4.6 is a frontier-scale 355B parameter Mixture-of-Experts model with a 200K context window and 128K output capability. MIT licensed, making it the only model in its class that enterprises can self-host and deeply customize. Dominates LiveCodeBench v6 (#1, 82.8%), HLE (#1), excels at AIME 2025 (#3, 93.9%) and Terminal-Bench (#3, 40.5%). Near parity with Claude Sonnet 4 (48.6% win rate) while dramatically outperforming other open-source baselines. Purpose-built for agentic workflows, real-world coding, and tool-augmented problem-solving. Supports native tool calling during inference for complex multi-step tasks.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- Hybrid Standard off
Knowledge horizon
Veröffentlicht Sep 2025
Today
Since release 9 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen