GLM-4.6

von ZAI

Specifications

Input
Output
Context window: 203K tokens
Veröffentlicht: Sep 2025

Performance

Speed: 26 t/s
TTFT: 1.2s
Latency: 24.0s
Intelligence: —

Pricing

Eingabe: €0.40
Ausgabe: €1.60

Über dieses Modell

GLM-4.6 is a frontier-scale 355B parameter Mixture-of-Experts model with a 200K context window and 128K output capability. MIT licensed, making it the only model in its class that enterprises can self-host and deeply customize. Dominates LiveCodeBench v6 (#1, 82.8%), HLE (#1), excels at AIME 2025 (#3, 93.9%) and Terminal-Bench (#3, 40.5%). Near parity with Claude Sonnet 4 (48.6% win rate) while dramatically outperforming other open-source baselines. Purpose-built for agentic workflows, real-world coding, and tool-augmented problem-solving. Supports native tool calling during inference for complex multi-step tasks.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: Hybrid Standard off

Knowledge horizon

Veröffentlicht Sep 2025

Today

Since release 9 mo

GLM-4.6

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

DeepSeek V4 Flash

MiniMax M2.5

Mistral Medium 3.5 128B