GPT-OSS 20B
Specifications
- Input
- Output
- Context window
- 131K tokens
- Veröffentlicht
- Aug 2025
Performance
- Speed
- 153 t/s
- TTFT
- 1.1s
- Latency
- 310 ms
- Intelligence
- —
Pricing
- Eingabe
- €0.03 per 1M tokens
- Ausgabe
- €0.13 per 1M tokens
Über dieses Modell
GPT-OSS 20B is a compact 21B parameter Mixture-of-Experts model with 3.6B active parameters, designed for lower latency and local deployment. Runs within 16GB memory with configurable reasoning effort, full chain-of-thought access, and native agentic capabilities including function calling and structured outputs. Released under Apache 2.0 license, ideal for specialized fine-tuning on consumer hardware. Companion model to GPT-OSS 120B optimized for speed while maintaining strong reasoning capabilities.
Technische Daten
- Fähigkeiten
- Eingabe-Modalitäten
- Ausgabe-Modalitäten
- Reasoning
- Standard on high
Knowledge horizon
Wissensstand Jun 2024
Veröffentlicht Aug 2025
Today
Training to release 14 mo Since release 10 mo
See also
Modell zum Vergleich hinzufügen
Nach einem Modell suchen