GPT-OSS 120B

von OpenAI

Specifications

Input
Output
Context window: 131K tokens
Veröffentlicht: Aug 2025

Performance

Speed: 649 t/s
TTFT: 738 ms
Latency: 227 ms
Intelligence: —

Pricing

Eingabe: €0.22
Ausgabe: €0.66

Über dieses Modell

GPT-OSS 120B is a powerful 117B parameter Mixture-of-Experts reasoning model with 5.1B active parameters, released under Apache 2.0. Features configurable reasoning effort (low/medium/high), full chain-of-thought visibility, and runs on a single 80GB GPU thanks to MXFP4 quantization. Native support for function calling, web browsing, Python code execution, and structured outputs. Designed for agentic tasks and complex reasoning with production-grade performance. Fully customizable for specialized use cases on single H100/MI300X.

Technische Daten

Fähigkeiten
Eingabe-Modalitäten
Ausgabe-Modalitäten
Reasoning: Standard on high

Knowledge horizon

Wissensstand Jun 2024

Veröffentlicht Aug 2025

Today

Training to release 14 mo Since release 11 mo

GPT-OSS 120B

Specifications

Performance

Pricing

Über dieses Modell

Technische Daten

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B