Command Palette
Search for a command to run

DeepSeek R1 0528

by Deepseek

Specifications

Input
Output
Context window
164K tokens
Released
May 2025

Performance

Speed
13 t/s
TTFT
2.2s
Latency
736 ms
Intelligence

Pricing

Input
€0.65
per 1M tokens
Output
€2.60
per 1M tokens

About this model

DeepSeek R1 0528 is an upgraded 685B parameter reasoning model with significantly enhanced depth of reasoning and inference capabilities. Achieves 87.5% on AIME 2025 (up from 70%), 91.4% on AIME 2024, 73.3% on LiveCodeBench, and 1930 Codeforces rating. Features system prompt support, averages 23K thinking tokens per question for deeper analysis, and reduced hallucination rate. Released under MIT license supporting commercial use and distillation. Performance approaching O3 and Gemini 2.5 Pro levels.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
Default on

Knowledge horizon

Released May 2025
Today
Since release 13 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run