DeepSeek V4 Pro
Specifications
- Input
- Output
- Context window
- 1M tokens
- Veröffentlicht
- Apr 2026
Performance
- Speed
- 17 t/s
- TTFT
- 273 ms
- Latency
- 436 ms
- Intelligence
- —
Pricing
- Eingabe
- €1.60 per 1M tokens
- Ausgabe
- €3.20 per 1M tokens
Über dieses Modell
DeepSeek V4 Pro is a 1.6 T parameter Mixture-of-Experts (MoE) chat model from DeepSeek AI. It features a hybrid attention architecture with compressed sparse and heavily compressed attention, supporting a context window of one million tokens. The model achieves 90.1 % EM on MMLU, 76.8 % Pass@1 on HumanEval, and 51.5 % EM on LongBench‑V2, demonstrating strong language, coding, and long‑context capabilities. It is released under the MIT License.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- Hybrid Standard on
Knowledge horizon
Veröffentlicht Apr 2026
Today
Since release 2 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen