Command Palette
Search for a command to run

Qwen3 235B A22B Instruct

by Qwen

Specifications

Input
Output
Context window
262K tokens
Released
Jul 2025

Performance

Speed
66 t/s
TTFT
1.8s
Latency
634 ms
Intelligence

Pricing

Input
€0.20
per 1M tokens
Output
€0.60
per 1M tokens

About this model

Qwen3 235B A22B Instruct is a Mixture-of-Experts model with 235B total parameters and 22B activated, featuring 128 experts with 8 activated per token. Native 262K context extended to 1M tokens via Dual Chunk Attention. Achieves SOTA: 83.0 MMLU-Pro, 70.3 AIME25, 41.8 ARC-AGI, 79.2 Arena-Hard v2, 51.8 LiveCodeBench, 70.9 BFCL-v3. Non-thinking mode focused on direct task execution with enhanced instruction following, logical reasoning, and long-tail knowledge across multiple languages. Dramatically more efficient than full 235B models.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning
No

Knowledge horizon

Released Jul 2025
Today
Since release 11 mo

See also

Add Model to Comparison
Search for a model to add
Command Palette
Search for a command to run