Hermes 4 405B
Specifications
- Input
- Output
- Context window
- 128K tokens
- Veröffentlicht
- Aug 2025
Performance
- Speed
- 41 t/s
- TTFT
- 224 ms
- Latency
- 489 ms
- Intelligence
- —
Pricing
- Eingabe
- €0.95 per 1M tokens
- Ausgabe
- €2.85 per 1M tokens
Über dieses Modell
NousResearch Hermes 4 405B is the flagship hybrid-mode reasoning model based on Meta's Llama-3.1-405B architecture. Trained on a massive ~60B token corpus with explicit <think> deliberation segments, it delivers frontier-level performance in math, code, STEM, logic, and creative tasks. Achieves SOTA on RefusalBench for helpful, uncensored responses aligned to user values. Supports advanced function calling, structured JSON outputs, and tool use with extreme steerability and reduced refusal rates.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- Hybrid Standard off
Knowledge horizon
Veröffentlicht Aug 2025
Today
Since release 10 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen