GPT-OSS 120B

by OpenAI

Specifications

Input
Output
Context window: 131K tokens
Released: Aug 2025

Performance

Speed: 752 t/s
TTFT: 738 ms
Latency: 4.6s
Intelligence: —

Pricing

Input: €0.22
Output: €0.66

About this model

GPT-OSS 120B is a powerful 117B parameter Mixture-of-Experts reasoning model with 5.1B active parameters, released under Apache 2.0. Features configurable reasoning effort (low/medium/high), full chain-of-thought visibility, and runs on a single 80GB GPU thanks to MXFP4 quantization. Native support for function calling, web browsing, Python code execution, and structured outputs. Designed for agentic tasks and complex reasoning with production-grade performance. Fully customizable for specialized use cases on single H100/MI300X.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Default on high

Knowledge horizon

Knowledge cutoff Jun 2024

Released Aug 2025

Today

Training to release 14 mo Since release 10 mo

GPT-OSS 120B

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.5

Qwen 3.6 27B

Gemma 4 31B