GPT-OSS 20B

by OpenAI

Specifications

Input
Output
Context window: 131K tokens
Released: Aug 2025

Performance

Speed: 153 t/s
TTFT: 1.4s
Latency: 306 ms
Intelligence: —

Pricing

Input: €0.03
Output: €0.13

About this model

GPT-OSS 20B is a compact 21B parameter Mixture-of-Experts model with 3.6B active parameters, designed for lower latency and local deployment. Runs within 16GB memory with configurable reasoning effort, full chain-of-thought access, and native agentic capabilities including function calling and structured outputs. Released under Apache 2.0 license, ideal for specialized fine-tuning on consumer hardware. Companion model to GPT-OSS 120B optimized for speed while maintaining strong reasoning capabilities.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: Default on high

Knowledge horizon

Knowledge cutoff Jun 2024

Released Aug 2025

Today

Training to release 14 mo Since release 11 mo

GPT-OSS 20B

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.6

Qwen 3.6 27B

Gemma 4 31B