Qwen3 235B A22B Instruct

by Qwen

Specifications

Input
Output
Context window: 262K tokens
Released: Jul 2025

Performance

Speed: 66 t/s
TTFT: 1.8s
Latency: 634 ms
Intelligence: —

Pricing

Input: €0.20
Output: €0.60

About this model

Qwen3 235B A22B Instruct is a Mixture-of-Experts model with 235B total parameters and 22B activated, featuring 128 experts with 8 activated per token. Native 262K context extended to 1M tokens via Dual Chunk Attention. Achieves SOTA: 83.0 MMLU-Pro, 70.3 AIME25, 41.8 ARC-AGI, 79.2 Arena-Hard v2, 51.8 LiveCodeBench, 70.9 BFCL-v3. Non-thinking mode focused on direct task execution with enhanced instruction following, logical reasoning, and long-tail knowledge across multiple languages. Dramatically more efficient than full 235B models.

Technical specifications

Capabilities
Input modalities
Output modalities
Reasoning: No

Knowledge horizon

Released Jul 2025

Today

Since release 11 mo

Qwen3 235B A22B Instruct

Specifications

Performance

Pricing

About this model

Technical specifications

Knowledge horizon

See also

Kimi K2.5

Qwen 3.6 27B

Gemma 4 31B