Hermes 4 405B

von NousResearch

Specifications

Input
Output
Context window: 128K tokens
Veröffentlicht: Aug 2025

Performance

Speed: 41 t/s
TTFT: 258 ms
Latency: 122 ms
Intelligence: —

Pricing

Eingabe: €0.95
Ausgabe: €2.85

Über dieses Modell

NousResearch Hermes 4 405B is the flagship hybrid-mode reasoning model based on Meta's Llama-3.1-405B architecture. Trained on a massive ~60B token corpus with explicit <think> deliberation segments, it delivers frontier-level performance in math, code, STEM, logic, and creative tasks. Achieves SOTA on RefusalBench for helpful, uncensored responses aligned to user values. Supports advanced function calling, structured JSON outputs, and tool use with extreme steerability and reduced refusal rates.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: Hybrid Standard off

Knowledge horizon

Veröffentlicht Aug 2025

Today

Since release 11 mo

Hermes 4 405B

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B