DeepSeek V4 Flash

von Deepseek

Specifications

Input
Output
Context window: 1M tokens
Veröffentlicht: Apr 2026

Performance

Speed: 124 t/s
TTFT: 1.4s
Latency: 418 ms
Intelligence: —

Pricing

Eingabe: €0.15
Ausgabe: €0.30

Über dieses Modell

DeepSeek V4 Flash is a 284 B parameter Mixture-of-Experts (MoE) chat model from DeepSeek AI. It features a hybrid attention architecture with compressed sparse and heavily compressed attention, supporting a 1 million token context window. The model achieves 88.7 % EM on MMLU, 69.5 % Pass@1 on HumanEval, and 44.7 % EM on LongBench‑V2, demonstrating strong language, coding, and long‑context capabilities. It is released under the MIT License.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: Hybrid Standard on

Knowledge horizon

Veröffentlicht Apr 2026

Today

Since release 3 mo

DeepSeek V4 Flash

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B